Set Items Description 

51 4669973 INTELLECTUAL () PROPERT? OR PATENT? OR COPYRIGHT? OR TRADEMA- 

RK? 

52 20577 TRADE () DRESS? OR DESIGN () RIGHT? OR PROPRIETARY () INFORMATIO- 

N? 

53 2 LEGALLY 0 PROTECTABLE () KNOWLEDGE OR TANGIBLE () RESEARCH () PRO- 

PERTY 

54 1067220 DATABASE OR DATABANK OR DATA ( ) (BASE? OR BANK? OR FILE? OR - 

REPOSITOR? OR WAREHOUSE? ) OR DB OR RDB OR OODB OR ODBC OR DBMS 

55 10506358 NETWORK? OR NET? ? OR INTERNET? OR INTRANET? OR LAN? ? OR - 

WAN? ? OR ONLINE 

56 7077174 ETHERNET? OR EXTRANET? OR WWW OR WORLD () WIDE () WEB OR WORLD - 

WIDEWEB OR SUBNET? 

57 4963437 WEBSITE? OR WEB () SITE? OR WEBPAGE? OR WEB () PAGE? OR WEB()A- 

DDRESS? OR URL?? OR URI?? 

58 5522278 HOMEPAGE? OR HOME () PAGE? OR FRONTPAGE? OR FRONT () PAGE? OR - 

SITE? OR HTML () FILE? 

59 1096792 SI :S3 (10N) S4 : S8 

510 335256 SEARCH? OR RESEARCH? OR RETRIEV? OR INQUIR? OR QUERY? OR Q- 

UERIES OR EXAMIN? OR INSPECT? 

511 217257 REQUEST? OR DATA () MINE? OR DATA () MINING? OR DATAMINE? OR D- 

ATAMINING? OR FIND? OR DISCOVER? 

512 104776 INTERROGAT? OR WEBCRAWL? OR WEB () CRAWL? OR METACRAWL? OR M- 

ETA() CRAWL? OR SEEK? OR SORT? OR HUNT? 

513 3488 JAPIO OR JPO OR EPO OR USPTO OR WIPO 

514 472633 STORE OR STORING OR MEMORY OR ACCUMULAT? OR RECEIV? OR ACC- 

EPT? OR ACQUIR? OR OBTAIN? OR CULL? OR CACHE? 

515 296764 STOCK? OR COLLECT? OR GATHER? OR GLEAN? OR AMASS??? OR ACC- 

RU? OR AGGREGAT? OR COMPIL? OR SIFT? OR CACHING 

516 556334 PULL () DOWN? OR TAKE? OR STORAGE? OR TAKING? OR DERIV? OR P- 

ROCUR??? OR GET? OR TAP? ? OR CAPACIT? 

517 114549 CONVERT OR CONVERTS OR CONVERTING OR CONVERSION? OR TRANSF- 

ORM? OR ALTER??? OR REFORMAT? OR EXTRACT? 

518 364108 MODIF? OR REVIS??? OR TRANSLAT? OR REMODEL? OR ADAPT? OR C- 

HANGE OR CHANGE? ? OR CHANGING? OR COLLAT? 

519 625636 FIRST? OR 1ST OR PRIMARY OR INITIAL? OR ORIGINAL? OR LEADO- 

FF? OR MAIN OR CHIEF OR INTRODUCTORY? 

520 438559 SECOND? OR 2ND OR DOUBL? OR TWIN? OR EXTRA? OR DUPLICAT? OR 

ANOTHER OR SUBSIDIAR? OR AUXILIAR? 

521 349961 THREE? OR TRIO? OR TRIUNE? OR TRIAD? OR TRIPL? OR TERTIAR? 

OR THIRD OR 3RD 

522 19714 (S9 OR S13) (5N)S10:S12 

523 4776 S22 AND (S14:S16 AND S17 :S18) ( 10N) (SI : S8) 

524 3664 S23 AND (S17:S18 AND S19 : S21) ( 10N) (SI : S8) 

525 1673 S24 AND S17 : S18 (10N) (SI :S3 OR S14:S16) 

526 1474 S25 AND (S14:S16 AND S17:S18) (5N) (S1:S8) 

527 888 S26 AND SI : S3 (5N) S4 : S8 

528 881 S27 AND S1(5N)S4:S8 

529 225 S27 AND S1(5N)S7:S8 

530 659 S28 AND S1(3N)S4:S8 

531 117 S29:S30 AND S17 : S18 (10N) S19 : S21 AND SI ( 10N) (S7 : S8) 

532 39 S31 AND (S14:S16 AND S17:S18) (5N) (S19:S21 AND SI) (5N)S7:S8 

533 242 S29 OR S31:S32 

534 636696 PD>2000 

535 78 S33 NOT S34 

536 65 RD (unique items) 
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TEXT: 

. . . as well. 

This article is an overview of how the image retrieval software from 
these three companies works, based on demo image collections on their 
Web sites . Their software is usually licensed to third -party vendors 
for specific applications, unlike the software employed by the major search 
engines . But ... 

...image search engines, but it may have to do with the size of the image 
collection at its demo site . 

The most telling test searches were related to national flags (Figure 
1) . These are very... as the San Francisco Museum of Modern Art or the 
French Ministry of Culture, that want to offer searchable art 
collections by similarity measures. 

Although only three attributes (color layout, color percentage, and 
texture) can be used as search criteria for most... 

...coarseness, contrast, and presence or absence of directionality of lines 
in images. 

IBM 1 s demo site contains the two collections that demonstrate best 
the power of the software in finding similar images. One is the... 

...sound like high-tech software gadgets for artsy folks, but a look at 
IBM's trademark image collection on its demo site reveals how 
important similarity searching in business (and science) applications can 
be. Developing, introducing, and protecting a distinctive sign or... 

...the Coca Cola trademark, or the shape of the Continental and Nike logos. 

Among the first steps in developing a trademark is to find if 
there are similar logos or designs. This can be a costly... 

...40 per hour, but it is available only in the Virginia office. The CD-ROM 
trademark collection is rather expensive and excels in smart text 
searching. Consulting the trademark archive on microfiche... 

...the Web free of charge. It is possible that IBM will do it again with 
trademarks . This is important because while there are good patent 
databases online with sophisticated search facilities, trademark 
searching depends on structure and pattern recognition that none of the 
currently existing trademark databases offer... 

...is far more simple and efficient to search by design pattern. Although 
IBM's sample collection is very small (1,000 trademarks ), the test 



searches proved the excellent quality of the QBIC software as customized 
for trademark. . . 

...selected as the base figure. By clicking on the base image, its shape 
component is extracted and displayed to the left of the original 
trademark , and the software goes out to find images with similar shape 
components. Figure 4 shows .. .mostly done automatically, indexing of images 
will follow this model, especially for images stored on Web pages . 
Image collections are likely to use both human indexing and 
classification and automatic extraction of visual attributes to 
facilitate finding highly relevant images swiftly. As this happens, image 
collections . . . 
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Language: English Record Type: Fulltext 
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Word Count: 328 

(USE FORMAT 7 FOR FULLTEXT) 
TEXT: 

...developed by Manning & Napier Information Services (MNIS) is now being 
demonstrated via the Internet's World Wide Web ( WWW ). MAPIT, which 
the company describes as its " patent data mining tool" compares the 
contents of thousands of patents and automatically produces visualizations 
of the research results. The MNIS technology behind MAPIT was first 
developed for government intelligence agencies, and later adapted for use 
in major corporations and patent offices around the world, according to 
MNIS. The service is offered as either an outsourced product or for 
installation on a corporate network . MAPIT applies advanced technology to 
patent research by which large patent portfolios can be analyzed 
identify similarities, differences, and relationships. The research results 
can be visually... 

...a detailed analysis of possible infringement. The online MAPIT 
demonstration may be viewed at: http:// www .mnis. net /MAPITdemo or 
through a patent research service offered on the Internet WWW by 
IBM by following the "Resource Page" link at IBM's website : http:// 
patent .womplex.ibm.com/respage.html The IBM website is an electronic 
"front door" to a database of patents issued in the U.S. during the 
last 27 years, plus 17 years of patent images. The IBM site contains 
over 2 million patents . In addition to its sophisticated, natural 
language processing technology, MAPIT incorporates the same DB2 -based 
technology that IBM uses on its site to store over 100GB of patent 
data. MAPIT will be available as a value-added option to companies which 
purchase their. . . 
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IMAGES OF HISTORY CAPTURED BY MODERN TECHNOLOGY 
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Dec 15, 1993 

Language: English Record Type: Fulltext 
Document Type: Newsletter; Trade 
Word Count: 998 

ownership documents that date back as far as 1788. Federal land 
records --field notes, survey plats, patent records, tract books and case 
files--for the states bordering and east of the Mississippi... 

...million, six-year project is in its second year and is expected to scan 
and store more than seven million of BLM's Government Land Office (GLO) 
records. So far, GLO has processed more than 1.6 million documents. 
Expense. . . 

...age and size of the documents, particular care is exercised in the 
scanning process. The land patents , as they are called, are on 
parchment paper and measure approximately 14 X 20 inches... 
...Each scanner is capable of processing more than 1,250 images per day 
with each patent taking about ten seconds . 

After capture, the image is sent via a Novell network to an indexer, 
who extracts 37 items of information from the actual image. Data is 
indexed into fields and entered... 

...do any image twice," says Lamar McCown, DCI's project manager for the 
digital record conversion process. 

BLM researchers are now able to electronically access land patents 
from Florida, Louisiana, Arkansas, Ohio, Michigan, Wisconsin and Minnesota 
via NEC PowerMate 4 86SX/25e PC... 

...so records can be located in less than a minute. 

Retrieval requires less than 15 seconds . Access to the database 
costs $2.00 per query session minute and printed images cost $0.13 per 
page. 

On- site access improved, but off- site image retrieval even 
better 

Most patent requests come from title attorneys and abstract 
companies that before they insure a parcel of land . , . 

. . . the BLM for the patents . 

Rather than have investigators travel to Virginia to access a patent 
, the BLM is implementing a off- site retrieval and fax service so users 
can query their searches by modem. "The system will free... 



Set Items Description 

51 3235538 INTELLECTUAL () PROPERT? OR PATENT? OR COPYRIGHT? OR TRADEMA- 

RK? 

52 25677 TRADE ( ) DRESS? OR DESIGN 0 RIGHT? OR PROPRIETARY () INFORMATIO- 

N? 

53 2 LEGALLY () PROTECTABLE () KNOWLEDGE OR TANGIBLE () RESEARCH 0 PRO- 

PERTY 

54 1517121 DATABASE OR DATABANK OR DATA ( ) (BASE? OR BANK? OR FILE? OR - 

REPOSITOR? OR WAREHOUSE?) OR DB OR RDB OR OODB OR ODBC OR DBMS 

55 12447445 NETWORK? OR NET? ? OR INTERNET? OR INTRANET? OR LAN? ? OR - 

WAN? ? OR ONLINE 

56 6270238 ETHERNET? OR EXTRANET? OR WWW OR WORLD () WIDE () WEB OR WORLD- 

WIDEWEB OR SUBNET? 

57 3696931 WEBSITE? OR WEB () SITE? OR WEBPAGE? OR WEB () PAGE? OR WEB () A- 

DDRESS? OR URL?? OR URI?? 

58 5077209 HOMEPAGE? OR HOME () PAGE? OR FRONTPAGE? OR FRONT () PAGE? OR - 

SITE? OR HTML {) FILE? 

59 870949 SI : S3 (ION) S4 : S8 

510 261432 SEARCH? OR RESEARCH? OR RETRIEV? OR INQUIR? OR QUERY? OR Q- 

UERIES OR EXAMIN? OR INSPECT? 

511 185008 REQUEST? OR DATA ( ) MINE? OR DATA () MINING? OR DATAMINE? OR D- 

ATAMINING? OR FIND? OR DISCOVER? 

512 101224 INTERROGAT? OR WEB CRAWL? OR WEB () CRAWL? OR METACRAWL? OR M- 

ETA() CRAWL? OR SEEK? OR SORT? OR HUNT? 

513 4310 JAPIO OR JPO OR EPO OR USPTO OR WIPO 

514 378483 STORE OR STORING OR MEMORY OR ACCUMULAT? OR RECEIV? OR ACC- 

EPT? OR ACQUIR? OR OBTAIN? OR CULL? OR CACHE? 

515 237999 STOCK? OR COLLECT? OR GATHER? OR GLEAN? OR AMASS??? OR ACC- 

RU? OR AGGREGAT? OR COMPIL? OR SIFT? OR CACHING 

516 434143 PULL () DOWN? OR TAKE? OR STORAGE? OR TAKING? OR DERIV? OR P- 

ROCUR??? OR GET? OR TAP? ? OR CAPACIT? 

517 95843 CONVERT OR CONVERTS OR CONVERTING OR CONVERSION? OR TRANSF- 

ORM? OR ALTER??? OR REFORMAT? OR EXTRACT? 

518 306561 MODIF? OR REVIS??? OR TRANSLAT? OR REMODEL? OR ADAPT? OR C- 

HANGE OR CHANGE? ? OR CHANGING? OR COLLAT? 

519 526506 FIRST? OR 1ST OR PRIMARY OR INITIAL? OR ORIGINAL? OR LEADO- 

FF? OR MAIN OR CHIEF OR INTRODUCTORY? 

520 375558 SECOND? OR 2ND OR DOUBL? OR TWIN? OR EXTRA? OR DUPLICAT? OR 

ANOTHER OR SUBSIDIAR? OR AUXILIAR? 

521 296584 THREE? OR TRIO? OR TRIUNE? OR TRIAD? OR TRIPL? OR TERTIAR? 

OR THIRD OR 3RD 

522 19272 (S9 OR S13) (5N)S10:S12 

523 5534 S22 AND (S14:S16 AND S17 :S18) (10N) (SI : S8) 

524 4556 S23 AND <S17:S18 AND S19 : S21) (10N) (SI : S8) 

525 2354 S24 AND S17 :S18 (10N) (SI :S3 OR S14:S16) 

526 2154 S25 AND (S14:S16 AND S17:S18) (5N)S1:S8 

527 1550 S26 AND SI :S3 (5N) S4 : S8 

528 538 S27 AND S1(5N)S7:S8 

529 250 S28 AND S17:S18(10N) S19:S21 AND S1(10N)S7:S8 

530 96 S29 AND (S14:S16 AND S17:S18) (5N) (S19:S21 AND SI) (5N)S7:S8 

532 615440 PD>2000 

53 3 22 S30 NOT S3 2 

534 60 S29 NOT S32 

535 60 S34 NOT S32 

536 60 S33 OR S35 

537 44 RD (unique items) 
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ABSTRACT: 

...available on the desktops of corporate workers and institutional 
researchers to assist in the retrieval, extraction , and analysis of 
information. A Ph.D from the School of Information Studies at the... 

...she was instrumental in the development of DR-Link, the search engine 
for the MNIS online information service which was originally developed 
for military intelligence analysts. Liddy considers information analysis 
and knowledge extraction as the core business of TextWise. The company is 
developing CHronological Information Extraction System (CHESS) , a 
business tool for the information specialist involved in competitive 
intelligence. Article includes... 

TEXT: 

. . .Editor, Searcher Magazine 

where will tomorrow's Roger Summit come from? Who is designing the 
database management systems that will power information systems in the 
Third Millennium? We decided to talk to Elizabeth (Liz) Liddy of TextWise 
and Syracuse University, creator. . . 
...The key point is analysis. 

BQ: What direction do you think advanced search engines will take ? 

Liddy: Two ways -- either on intranets , the next big market, or as 
improved ways to access open sources, like traditional newspaper... 

...across potentially relevant documents. They need to see the themes, the 
relationships between documents. They want to hyperlink straight to the 
answer and get it "pre-read" for them. 

For example, one of the applications we're doing for... 

...of the content of documents. Term matching is still the same. They still 
haven* t extracted all the meaning intended in the document. For true 
natural language processing, you have to. . . 



.the documents. 



At thie point, we think it's too hard for some oldsters to change . At 
Textwise, we don't want to just sell an engine to librarians, we want 
to work with them as customers, look at problems, take a consulting role, 
and design products and services with full capability and precision. There 
are . . . 

. . .ve had some interesting discussions. We see our core business as 
information analysis and knowledge extraction , as building systems to 
track and visualize all the connections. When we build systems that... 

...so important. The other key thing is data mining, knowledge discovery, 
not juBt finding and extracting . 

We develop our technologies for the government and then commercialize them. 
Data mining for companies . . . 

...manual insertion of data. We can process 100,000 gigabytes of text and 
create a database automatically. We can extract data from text and move 
it into a different format, then let the data mining... 

...1998, we are going to offer a new syBtem for sale called CHESS 
(CHronological Information Extraction System). It's meant for the 
information specialist in a large agency doing competitive intelligence... 

...searchable quickly. Natural language processing has proven itself as 
offering great assistance to information retrieval, extraction , and 
analysis. It really works. It works fast. It works well. Adopt it. 

Once we . . . 

...Napier Information Systems (MNIS), Textwise 1 s commercial partner. 
Commercial products based on DR-LINK, the original Textwise information 
retrieval system, are now available through MNIS (http:// www . mnis.net), 
for example. Textwise currently has new products under development for 
information extraction , multilanguage retrieval, intelligent agents for 
finding information, and datamining of very large databases. Visualizers 
now. . . 

...the intellectual property assets of a potential acquisition. Use it in 
conjunction with the IBM patent search site Resource Page (http:// 
patent .wcomplex.ibm.com/respage.html) by choosing MAPIT. 

Custom hot sites : Organizations such as SCIP and AFCEA have established 
standing queries for their members on hot... 

...subtopics and find source documents for each subtopic. To see a 
demonstration, check out http:// www .afcea.org/drlink. 
Current 1998 Projects 

CHESS: an information extraction system which extracts information from 
an entire database and organizes it to track people, companies, 
countries, or organizations over time. CHESS also provides... 

...and preferences, resulting in improved precision. The system evolves 
automatically to match the user's changing interests. 



KNOW-IT: an integrated knowledge extraction system which takes raw 
text as input and outputs a structured knowledge base in any domain. The 
system includes navigation and collaboration tools for manual refinement of 
the extracted knowledge base. This data mining tool makes any recorded 
knowledge amenable to automatic conversion into... 
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(USE FORMAT 7 OR 9 FOR FULLTEXT) 
A New Patent Search Tool for the Internet s QPAT-US 
(QPAT-US is Questel-Orbit 's new database of full-text US patents from 
1974 -present; review of the product, along with practical tips on how to 
use. . . 

) 

ABSTRACT: 

QPAT-US is the new patent search tool for the Internet released by 
Questel-Orbit, QPAT-US, containing patent information from 1974 to the 
present, can be used by both the inexperienced and experienced patent 
searchers. While it directs the inexperienced searcher to the Main Search 
page for patent searches, it allows the experienced searchers to utilize 
fairly advanced search and display capabilities. Equipped... 

TEXT: 

by Nancy Lambert 
Information Analyst 

Chevron Research and Technology Co. 
photo omitted 

Patent searching on the Internet is a hot topic in the information 
world these days, even though the resources to. . . 

Internet patent * search resources took a big step forward with the 
introduction of QPAT-US, Questel-Orbit * s new database of full-text U.S. 
patents , 1974 to present. While this 110GB database is not free, 
subscriptions are priced competitively: unlimited... 

...Of interest to users of the PTO Internet database: Questel-Orbit has 
mounted a free database of front - page patent information with all 
the QPAT-US search features. And, Questel-Orbit has offered QPAT-US... 

...and happily browse through patent titles that the search produced, 50 
at a time. Experienced patent searchers can take advantage of some 
fairly sophisticated search and display capabilities. 



RELEVANCE RANKING 



All the searches you. 



...appears in a patent and weighs this against how common the term is in 
the database and the length of the patent compared to others in the 
database . If you're searching multiple terms, how many of them show up 
in a patent also affects its relevance. . . 

. . .other things being equal (including the frequency of the terms from both 
groups in the database ) , patents with 99 percent of term occurrences 
from one group and 1 percent from the other. . . 

. . .equally with patents with 50 percent of term occurrences from each 
group. Clearly, I would want to see the latter first. Questel-Orbit may 
want to adjust the factors that go into relevance ranking to take 
Boolean logic into consideration. 

photo omitted 

* Whenever I talk about free-text patent searching with. . . 

...searching is not currently useful, though, since the U.S. classes are 
those on the patents when they first issued. (See the "Wish List" 
section at the end of this article for more on. . . 

...in either direction of another group of search terms) --an important 
feature for full -text searching that search engines for other Internet 
patent databases don't offer. You can search multiword phrases by putting 
them in single quotes . . . 

...the search term box and adding or deleting parentheses, terms, and 
operators . 

You can also get online help in developing your search statement. The 
system automatically does what Questel-Orbit calls "search... 

...seriously skew the relevance ranking, since this is based partly on the 
frequency of a search term's occurrence in the database . If a 
particular misspelling occurs only once, the patent in which it appears 
will be thrown to the top of the relevance list. I... 

...search for "hypodermic," you'll find "syringe," "ampule," "cannula," 
and "needle" as statistically related. 

The third vocabulary aid, " Database Dictionary, 11 you use if your search 
term produced no hits. It shows you a list... 

...click on an answer set, you go to the View Set page and see an initial 
display of patent numbers and titles for the first 50 answers in 
relevance-ranked order. You may ask for the next 50, and so... 

...You may click on "expand view" to add two lines of abstract text to the 
first 50 titles. You may look through the titles and select patents of 
interest to order full copies, to put into a special "set 0" for further... 

...on the View Set page, you may do this Bame sort of process on the first 
five patents by clicking on "KWIC" (Key Word in Context) . At any point 
you can use Windows . . . 

...marked "similar" to see a menu of what QPAT-US calls "canned searches." 



Theee automatically extract data from the patent for use in new 
searches. You can search on the entire text (minus stop words) of the 
abstract or the main claim as a "natural language query" to get a new 
set of relevance-ranked patents . (I'm not sure just how useful this is, 
given how many totally irrelevant words are bound to show up in the 
abstract or main claim.) You can search on the patent ' s 
classifications, either the primary class or all classes, to find other 
patents in the same classes (but don't do this yet --see the first "Wish 
List. . . 

...variety of citation searches: forward (more recent patents that have 
cited this one) , backward (older patents cited in this one) , or " network 
" (a backward citation search followed by a forward citation search done 
on all the patents found in the backward... 

...how important up-to-date classes are for U.S. patent subject searching, 
since older patents are reclassified with new classes when these change 
(3) . Even if patent classifications are not the main point of this 
text-oriented database , they can be very useful both by themselves, if a 
good subclass exists for the... 

...page drawings and chemical structures. HTML provides for this 
possibility. 

One of the joys of patent searching online is being able to go quickly 
and easily between files, taking patent references from one database 
into another for further search refinement or viewing. QPAT-US is 
isolated now, with no easy links into Questel's... 

...would like to be able to start a search in, say, the merged API-World 
Patents Index database and then cross U.S. patents into QPAT-US for 
some full-text search refinements (or vice versa) . I also want to be able 
to move U.S. patents that I found from searching the indexed databases 
into QPAT-US to view them using its text-scanning capabilities. Ideally, 
the interface would be completely transparent from both the databases and 
the Internet 9ide. 
photo omitted 

When I'm searching QPAT-US by patent number, either directly or by 
transferring patents from other databases, I want to provide keywords 
or character strings that the software will use to create "found terms" at 
the top and highlighted text throughout each patent . 

I would like to see some changes in "fuzzy logic." I very much want to 
see postings listed beside the terms, to help me choose between 
one- time-only. . . 

...would give it a definite edge, making it one-stop shopping for much 
U.S. patent searching . 

My major wish is based more on Internet than QPAT capabilities. I wish 
everything would go faster! Several operations that I tried had... 

...customers a lot for that work time. This doesn't allow for a search that 
takes an hour on the Internet , which I could have done in ten minutes 
on a commercial database. 



Paul Albert of. 



...who led the design team for QPAT-US, wishes that customers would hurry 
up and get more powerful computers and Internet browsers, so that he 
could write operations to take advantage of their graphical interfaces. 
The most up-to-date systems have a lot of... 

...to manipulate information and find and display data. 

CONCLUSION 

I am still not an enthusiastic Internet -for- patent - searching 
advocate, unlike some others who are writing in the field (4,5). Nothing 
that I . . . 

...to give them access to these indexed databases. QPAT-US is essentially a 
free-text patent database , but it does take free-text searching and 
scanning to higher levels than anything else out there, either on... 
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...TEXT: 14 sectors that find patents to be an important information source 
is correlated with the patent propensity rate in each sector. The second 
part of the study is based on a combined survey and interview study of 
Dutch SAIEs in five high- technology sectors. The results show that SMIlEs 
mainly use patent databases to acquire information, often for legal 
purposes, that is not available from any other source. In contrast... 

...and expertise. This points to the need for simpler and more efficient 
methods of searching patent databases. 

Introduction 

In order to receive a patent in Europe, the US and many other 
countries, an applicant must disclose enough information about... 
...legal purposes connected to the management of intellectual property. For 
example, a firm can search patent applications to determine if a planned 
innovation project might infringe another firm's patents , to check if 
other firms are infringing its own patents, to challenge a competitor's... 

...applications and granted patents, while the US Patent and Trademark 
Office (PTO) contained several million patent records. IBM has 
established a free Internet site that contains the front page , 
abstract and other information for two million PTO patents issued since 
1971. Firms can use several keywords to search the IBM site , identify 
patents of interest and then immediately order a copy of the complete 
patent for a small . . . 

...here as firms with fewer than 500 employees, do not use patent data. A 
recent EPO survey of a sample of research and development (R&D) 
performing firms reports that only 14% of firms with fewer than... 

...used patent databases. 7 This survey, unlike the others, also asked the 
respondents about four main reasons for consulting patent databases: to 
develop new products (or processes) , to follow the activities of 
competitors, to prevent the duplication of R&D and to patent a new 
invention. Approximately 38% of the respondents that searched patent 
databases used the information. . . 

...asked why SMEs do not use patent databases. 

The failure of many SMEs to use patent databases is taken up by the 
recent Green Paper on innovation by the European Commission^ which 
comments ... to innovative firms, because firms that do not innovate are 
considerably less likely to find patent data of use. The first method 
consists of an analysis of the relevant results of the 1993 Community 
Innovation Survey. . . 



...to patent information of value for technical, competitive or legal 
reasons. The CIS did not obtain data on whether or not each firm applied 
for a patent , but ordinal data on the importance of patents as a method 
of profiting from innovation. . . 

...sources and for the relationship between firm size and the percentage of 
firms that use patent data. The main advantage of the EPO compared to 
the CIS survey is that the EPO study contains .patent disclosures as 'very 
important' or 'crucial 1 sources of information with estimates of the 
sectoral patent propensity rates. The latter are obtained from the PACE 
survey of Europe's largest R&D performing firms because the equivalent... 
...increases the probability that the firm will give a higher rating to the 
value of patent databases as an information source. 

The first three variables define fundamental characteristics of the 
firm. They consist of the log of the number. . . 

...a higher percentage of R&D performing firms than non-R&D performing 
firms use patent databases. 

The second set of variables measure the specific strategic environment of 
the firm. The proportion of sales ... intensity, and presumably exposed to 
greater competition, could be more likely to benefit from using patent 
databases to gather information about what their competitors are doing. 

The second and third logit models include the... 

. . . they encompass a range of new technologies that could be at the 
forefront of any changes in the importance of patenting . Two of these 
sectors, IT and instruments, are similar to the computing and instruments 
sectors. . .33.3% among the target population of 'possible' innovators. 

Detailed information on the use of patent databases was obtained 
through semistructured interviews with firms from each of the five sectors 
of interest that had. . . 

...applicant IT firms. 

These results are consistent with and support the CIS analyses and the 
findings of the EPO study. Patent applicant firms (or firms that 
probably apply for patents because they find patents... 

...information source or to search patent databases. This indicates that 
many firms do not search patent databases to obtain technical 
information. If they did, we would expect little difference in the use rate 
between. . . 

...the firm's patent rights; for instance, to ensure that the firm is not 
infringing another firm's patent or to obtain the necessary 
information on the prior art to draw up its own patent applications. The... 

...explored more closely in the interviews. 

Interview Results 

The interview results show that SMEs use patent databases for three 
basic reasons. The most important is for legal matters, as suspected given 



the higher use. 



...to obtain technical information for use in innovation. For example, each 
firm usually had a main reason and one or more secondary reasons for 
using patent databases. Obtaining technical information was the 
primary reason for searching patent databases for only tAo of 24 
interviewed firms that used patent databases, compared to 14... 

...on their competitors. However, slightly more than half of the 22 firms 
that did not obtain technical information from patent databases as a 
primary goal stated that this was a secondary reason for their use. 

The interviews showed that there is a simple explanation for why. . . 

. . .primarily use them to obtain information that is not available from any 
other source. 

The main cost-related obstacle cited by the respondents to using patent 
databases was the time and expertise required to conduct an adequate 
search, while access costs... 

...not search them in-house but employ external consultancy services or 
innovation centres to conduct patent searches for them. The other firms 
search patent databases themselves, three via on-line services, but 
several of these also use external consultants. 

(Graph Omitted) 

Captioned as: Figure 4 

Another reason why firms might not use patent databases to acquire 
technical information is the delay, in most countries, of 18 months between 
the submission of . . . 

...reason, the interviewed respondents were asked if they thought that the 
technical information in a patent database was up to date and of value, 
with one exception, most of the respondents f rom. information disclosed in 
patents was no longer useful because of the rapid rate of technical change 

in their field. Their primary reason for patenting in the first 
place was to be able to trade technologies through cross -licensing, rather 
than an interest. . . 

...this source of information was often based on two rational assessments 
of their usefulness. 

The first reason is linked to the patenting strategy of firms. Firms 
are less likely to apply for a patent if the cost... 

...potential or actual technology 'trading* rather than as a means of 
earning licence revenue. 

The second reason why SMEs seldom use patent databases to acquire 
technical information is because of the high costs of searching patent 
databases compared to much easier and cheaper alternative sources of 
technical information. Consequently, SMEs primarily use patent databases 
to acquire information that is not available from any other source; for 
example, to search the prior. . . 



...would invest in costly research to 'reinvent the wheel 1 . Unfortunately, 
current databases such as the Patent Register of the BPO are relatively 
simple to search to obtain information of value for legal purposes, but 
of no use as a source of technical . . . 

...for technical information. Several respondents commented that less 
expensive and time-consuming methods of accessing patent databases for 
example, via the Internet -would probably increase their use of these 
databases. This ...to reduce the time required to find something useful. 
Part of the problem is that patent databases must contain a substantial 
amount of similar and duplicative information. Software that could limit 
this repetition could be of great value. It may also be worth preparing and 
disseminating more information on how to use the Internet to access 
patent databases. A few of the interviewed managers, for example, did not 
know that several patent * . . 
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... quite happened yet. 

* The United States Patent and Trademark Office (USPTO) is mounting a 
free database of full- text U.S. patents , 1976-date, starting in 
November. 

* The European Patent Office (EPO) is mounting a free database of 
its member countries' patents , starting in October. 

* The IBM Patent Server will cease to exist on November 1, to... 

...in effect, writing about the future. Watch this space for more 
information as it develops. 

Online Patent Databases 

But first , what's new in online patent databases. 

IFI/Plenum Data Company: IFI/Plenum Data's big news is that the Dutch 

...in research- level scientific publishing. Harry All-cock, Vice President 
of IFI/Plenum Data, assures patent searchers that the change in 
ownership won't affect IFI databases and services. 
IFI has added U.S. patent... 

...dates to the CLAIMS databases. Searchers requested this because of 
complications brought about by the change in U.S. patent law on June 8, 
1995. Patents filed after that date are good for 20 years from date of 
first U.S. filing; but patents in force at that time are good for 20 
years from date of filing or.-. . 

...patents. Part of their excuse for theBe omissions has to do with Project 
Phoenix, their change in internal processing designed to speed up the 
passage of patents through abstracting and indexing and to get them 
online faster. Apparently, adding equivalent abstracts and patent 
citations took too long. In practice, Derwent probably wants to force 
customers to search patent... 

...and have added European and U.S. so far. They are indeed making 
progress. British patents now take just 15 days from arrival at Derwent 
to get full records with complete indexing online . U.S. patents take 

51 days as of Derwent week 9,832 (32nd week of 1998), down from 84... and 
examine full text on QPAT-US as usual, and then link to the IBM site to 
look at the patent images. One benefit: he IBM site does not see who is 
asking for the patents ; it sees only that the request comes from 
QPAT-US. This should help solve some security issues with the IBM site. 
(See the discussion later in this article on the new, improved IBM site .) 

INPI, the French Patent Office, has taken over management of all 
Markush chemical structure files, merging MPHARM (the pharmaceutical file 
that INPI . . . 



...STN/CAS: STN has introduced some new systems commands that will help 
customers who do patent multi-file searches and patent statistics. 
These are: 

1. "TRANSFER": Extracts and maps up to 50 r 000 pieces of data, e.g., 
patent numberB, from file... 

...perform these two functions.) 

3. "TABULATE": Creates, among other things, two-dimensional tables of 
data extracted from a set of patents , for instance patents by company 
by year for a technology of interest. My STN contact told me that... 
...extended family members (complex priority relationships). More details 
later, when CAS produces documentation for the online file. 

3. What percent of patent records will, in fact, include family 
information? 

Answer: Coverage is nearly complete from the early... free Internet 
databases of full-text U.S. patents, 1976 to date, and U.S. trademarks . 

The full -text patent database will use BRS Search software from 
Dataware, rather than the CNIDR software used on the current AIDS and U.S. 
patent frontpage (PatBib) databases. However, it will look like and have 
many of the same search features... 

. . . only . 

The AIDS database gives you the ability, via icons at the top of 
every patent front page , to search for similar patents : more in 
the same class, more by the same inventor or assignee, more "like this... 

...won't exist in the full-text database. 

Both the current databases provide date range searching . The 
full-text database will definitely allow date comparisons (all patents 
before/after a given date) ; it might or might not allow ranging between 
dates . 

The initial release of the full-text file will not have current U.S. 
patent classifications, but the PTO will add these later. 

The initial release will definitely not include proximity 
searching; that is, looking for one term within so... 

...partly because the PTO has made some concessions so as not to compete 
with commercial patent database producers and online hosts. Also, 
while the search engine does support proximity searching, the capability 
requires more storage and processing resources than the. . . 

. . .Information Dissemination (wes.gewehr®uspto.gov) . 

The PTO will mount the text portion of the patent database in 
November and add images next March. Low- resolution images will be free; 
high-resolution. . . 

...the AIDS database will go away; but the PTO will put one or more canned 
searches on the full -text database to produce subsets of AIDS patents . 

Elsewhere in the PTO: Keep aware of some problems with U.S. 
provisional applications, available . . . 

...provisional filing to a normal filing within the one-year time allowed. 
The U.S. patent does not include the provisional filing information on 
its front page , nor does the PTO include it in the tapes that it sends 
to patent database producers, so these family members are often not 
connected up. The PTO is aware of the problem, as are the database 



producers; a solution should be available soon. 
The European Patent Office 

Last year, the EPO announced changes in their data distribution and 
pricing policies. As part of a plan to maximize access... 

...country. Individual countries may choose to add additional coverage at 
their own expense. The EPO originally named this project DIPS 
(Distributed Internet Patent Information Services) , but recently 
changed the name to Esp@cenet. The sites have very limited search 
capabilities, but they do let searchers link to images of the... 

...the EPO Web site and investigated the databases, I found some very 
interesting inconsistencies. 

To get to the patent databases, you click on the Espocenet logo 
at the bottom center of the EPO home page . You immediately see two 
options: "Access via the European Patent Office" and "Access via the 
national offices of the member states." When you access via the European 
Patent Office, you then have four choices: Search in BPO , PCT, 
worldwide, or Japanese patents. When you access via the national offices of 
the member... in other cases, nothing. Most interesting, though: In many 
cases when I clicked on a patent , what I actually saw was an equivalent 
from another country, usually EPO or U.S. 

I r ll be interested in hearing what other searchers observe as they 
explore this international patent search site . 

Incidentally, the EPO home page has well -designed mechanisms for 
learning about, and getting around, the site . If you click on the 
navigational icon (the spinning EPO logo in the center of the page) , the 
system will guide you. Give it a try! 

The EPO site also has one of the best compilations of patent 
information resources on the Net . It links you to the patent offices of 
EPO member states and other countries, to patent information providers' 
and online hosts' home pages , to Internet patent databases, to 
patent -related mailing lists and news groups (including PIUG, the Patent 
Information Users Group), and lots more. How easily you can get to this 
resource from the EPO home page seems to vary over time. As of 
mid-October, you can click on "contact links... 

...right of the EPO home page. But you can also access it directly at 
http:// www .european- patent -office.org/ online /index. htm. 
Other International Patent Databases on the Internet 
Independent of the EPO, other countries are starting to put varying 
amounts of patent information free on the Internet . The Canadian 
Patent Office has mounted patents from 1989 to date, searchable only 
by title text. The French Patent Office has mounted patents from 1996 to 
date . . . 

...These data-bases won't replace DWPI or Inpadoc any time soon. 
The New IBM Patent Sites 

By the time you read this, the IBM Patent Server will be gone, 
replaced by two new patent information sites . (That phoenix analogy 
keeps coming up.) These are the IP Network (free) and the IP Network for 
Business (fee) . The free site will have all the features of the old 
Patent Server plus some nice new features, including 20 new patent fields 
to search, date ranging, and the ability to page through search results. 
IBM is also adding European patents to the free site , EPO-A from 1978 
to date and EPO-B ESPACE-A and ESPACE-B CD-ROM products. And, IBM is adding 
PCT patents : bibliographic information, abstracts, and front - page 



images from 1997 to date, and full document images from 1998 to date. 
The Business . . . 

. . . try them out . 
Conclusion 

I've written a lot in the last few years about patent resources on 
the Internet . As you can see, they're still growing fast, and they 
increasingly include international as... 

...no convenient access to patent information. But all the caveats that 
I've issued about Internet patent freebies still apply. If you need 
indepth subject access across multiple databases covering multiple 
countries' patents , you still need the commercial indexed patent 
databases. Sometimes you get what you pay for. 



Set Items Description 

51 8377481 SEARCH? OR RESEARCH? OR RBTRIEV? OR INQUIR? OR QUERY? OR Q- 

UERIES OR EXAM IN? OR INSPECT? 

52 3144275 REQUEST? OR DATA () MINE? OR DATA {) MINING? OR DATAMINE? OR D- 

ATAMINING? OR FIND? OR DISCOVER? 

53 582521 INTERROGAT? OR WEBCRAWL? OR WEB () CRAWL? OR METACRAWL? OR M- 

ETA() CRAWL? OR SEEK? OR SORT? OR HUNT? 

54 897634 INTELLECTUAL () PROPERT? OR PATENT? OR COPYRIGHT? OR TRADEMA- 

RK? 

55 937 TRADE 0 DRESS? OR DESIGN () RIGHT? OR PROPRIETARY () INFORMATIO- 

N? 

56 0 LEGALLY () PROTECTABLE () KNOWLEDGE OR TANGIBLE {) RESEARCH () PRO- 

PERTY 

57 9595 JAPIO OR JPO OR EPO OR USPTO OR WIPO 

58 979283 DATABASE. OR DATABANK OR DATA ( ) (BASE? OR BANK? OR FILE? OR - 

REPOSITOR? OR WAREHOUSE?) OR DB OR RDB OR OODB OR ODBC OR DBMS 

59 4246060 NETWORK? OR NET? ? OR INTERNET? OR INTRANET? OR LAN? ? OR - 

WAN? ? OR ONLINE 

510 162661 ETHERNET? OR EXTRANET? OR WWW OR WORLD () WIDE () WEB OR WORLD - 

WIDEWEB OR SUBNET? 

511 314756 WEBSITE? OR WEB()SITE? OR WEB PAGE? OR WEB () PAGE? OR WEB()A- 

DDRESS? OR URL?? OR URI?? 

512 2198390 HOMEPAGE? OR HOME () PAGE? OR FRONTPAGE? OR FRONT () PAGE? OR - 

SITE? OR HTML () FILE? 

513 9460743 STORE OR STORING OR MEMORY OR ACCUMULAT? OR RECEIV? OR ACC- 

EPT? OR ACQUIR? OR OBTAIN? OR CULL? OR CACHE? 

514 3145237 STOCK? OR COLLECT? OR GATHER? OR GLEAN? OR AMASS??? OR ACC- 

RU? OR AGGREGAT? OR COMPIL? OR SIFT? OR CACHING 

515 8982077 PULL () DOWN? OR TAKE? OR STORAGE? OR TAKING? OR DERIV? OR P- 

ROCUR??? OR GET? OR TAP? ? OR CAPACIT? 

516 5054689 CONVERT OR CONVERTS OR CONVERTING OR CONVERSION? OR TRANSF- 

ORM? OR ALTER??? OR REFORMAT? OR EXTRACT? 

517 8874634 MODIF? OR REVIS??? OR TRANSLAT? OR REMODEL? OR ADAPT? OR C- 

HANGE OR CHANGE? ? OR CHANGING? OR COLLAT? 

518 9681972 FIRST? OR 1ST OR PRIMARY OR INITIAL? OR ORIGINAL? OR LEADO- 

FF? OR MAIN OR CHIEF OR INTRODUCTORY? 

519 8017483 SECOND? OR 2ND OR DOUBL? OR TWIN? OR EXTRA? OR DUPLICAT? OR 

ANOTHER OR SUBSIDIAR? OR AUXILIAR? 

520 6018468 THREE? OR TRIO? OR TRIUNE? OR TRIAD? OR TRIPL? OR TERTIAR? 

OR THIRD OR 3RD 
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ysrtffact: challenge in lrTdexBjg unstructured hypertext databases is to 
ltomatically extract meta^a^ar^that enables structured searching using 
j(opic^J u) . y o n om i o o , c ^rctTrtvvents keyword ambiguity and improves the quality of 
searching and profile-based routing and filtering. Therefore, an accurate 
classifier is an essential component of... 

... documents having known topics. We experimented with pre-classif ied 
samples from Yahoo! and the US Patent Database . We have developed a 
text classifier that misclassif ies only 13% of the documents in the Reuters 
benchmark; this is comparable to the best results ever obtained . Our new 
classifier misclassif ied 36% of the patents , indicating that classifying 
hypertext can be more difficult than classifying text. Naively using terms 
in. . . 

...Descriptors: information retrieval ,* 
...Identifiers: structured searching ; ... 

. . . search quality. . . 
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Title: Revolution or evolution? The impact of the Internet , end user 
interfaces and new software on chemical and patent information 

Abstract: Looks at the Internet and what it offers searchers of 
chemistry and patent literature. I comment on some of the developments I 
see happening, or would like to see happen. We are seeing a lot of exciting 
changes taking place, and a lot of research is going on into new 
methods of searching , retrieving , storing and displaying chemical and 
patent information. Old ways are being broken or amended, and many 
newcomers are experiencing the joys of electronic information for the 
first time. Is it revolution or evolution? I would call it evolution, but 
evolution at a. . . 
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Abstract: The 1970s and 1980s saw the rise and rapid development of 
online textual databases, particularly those covering science and 
technology. Online patents files are amongst the most used of these 
databases, and are now accessed worldwide by both patent specialists and 
researchers . However, most online databases covering intellectual 
property are based on an original document or publication which 
includes highly meaningful drawings, chemical structures or diagrams. The 
rapid advances in image handling and distribution technology, such as 
document image processing (DIP), image extraction , CD-ROM, WORM discs, 
erasable optical discs, high speed laser printing, and the delivery of... 
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Title: DIALOGLINK and TRADEMARKS CAN -FEDERAL: pioneers in online images 
Abstract: DIALOG and Thomson & Thomson teamed up to produce one of the 
first commercial online databases containing both text and images. 
Thomson & Thomson, a trademark and copyright research firm, added 

trademark designs to its text file, TRADEMARKSCAN- FEDERAL. 

Simultaneously, DIALOG developed DIALOGLINK, version 1.20, a powerful and 
easy-to-use software to receive and display images online from 
mainframe computers. The author evaluates TRADEMARKSCAN image retrieval 

using DIALOGLINK 1.20 communications software. She discusses 

TRADEMARKSCAN 1 s composition, retrieval methods, file size, downloading 
and printing time and costs, and database applications. She also discusses 
software requirements and functions that are relevant to image retrieval . 
Suggestions for improvements to TRADEMARKSCAN and DIALOGLINK are also 
given . 
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Abstract: A application of synergetic neural network (SNN) for content 
based retrieval was developed that are robust against noise, partial 
occlusions and is capable of producing fast response to input queries - 
The SNN is a top-down self -organizing system, which incorporates many of 
the basic concepts of synergetics. The system enables to support affine 
invariant of input queries which are a partial version of the stored 
patterns. It was observed that the number of visual keywords do not 
increase even when new trademark images were added to the database . 
(Edited abstract) 11 Refs. 
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Strategy for information on patents in the era of networks 
ABSTRACT: Taking a general view of an environmental change in a 

distribution of information on patents that simplified access to such 
information is provided because more datacomes from sources and 
programs of software for retrieval are improved, it indeed seems to 
us thattoday is brought in the era of which anyone can obtain 
desirable information on patents . On theother hand, taking into 
consideration of a peculiar feature of the information on patents 
that errorfor retrieval is not permitted, the environmental change 
further provides difficulty of the access thatis caused by a lot of 
obtainable information, and a variety of media sources. From the 
pointsdiscussed above, it will be understood... 

...end users. That is, the information necessary for users is notfound by 
users, but a third party selects and customizes the information to be 
delivered to theusers. In this case, if... 

...to external sourceshas to be carried out, while a serious view of core 

competence is taken by companies themselves, (author abst.) 
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...including intellectual assets, whose value over the last seven years has 
increased seven- fold. The first step in putting a numeric value on an 
organization's intellectual property is the process of making the asset 
as tangible as possible. Ideas, processes, concepts/business intelligence, 
CAD drawings, database entries, procedure manuals, and patents ran b^ 
converted to archived docu ments, so organizations can better measure the *" 
Value 6t inteliectudl'adsets anff include them as tangible assets. IPAM is a 
patent management and product data tool with a collection of analysis 
tools and databases for organization and analysis of intellectual assets. 
SmartPatent Workbench is a desktop client package that allows customers to 
see, print, search , organize, and analyze patent documents. Dow 
Chemical, a user of IPAM, says quantification of intellectual assets can 
lead to. . . 

DESCRIPTORS: Engineering Documentation; Patents ; Record Management; 
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Set Items Description 

51 1701624 INTELLECTUAL ( ) PRO PERT? OR PATENT? OR COPYRIGHT? OR TRADEMA- 

RK? 

52 998 TRADE () DRESS? OR DESIGN () RIGHT? OR PROPRIETARY () INFORMATIO- 

N? 

53 0 LEGALLY () PROTECTABLE () KNOWLEDGE OR TANGIBLE () RESEARCH () PRO- 

PERTY 

54 219476 DATABASE OR DATABANK OR DATA ( ) (BASE? OR BANK? OR PILE? OR - 

REPOSITOR? OR WAREHOUSE?) OR DB .OR RDB OR OODB OR ODBC OR DBMS 

55 610309 NETWORK? OR NET? ? OR INTERNET? OR INTRANET? OR LAN? ? OR - 

WAN? ? OR ONLINE 

56 - 65259 ETHERNET? OR EXTRANET? OR WWW OR WORLD ( ) WIDE ( ) WEB OR WORLD- 

WIDEWEB OR SUBNET? 

57 91895 WEBSITE? OR WEB () SITE? OR WEBPAGE? OR WEB () PAGE? OR WEB()A- 

DDRESS? OR URL?? OR URI?? 

58 318662 HOMEPAGE? OR HOME () PAGE? OR FRONTPAGE? OR FRONT () PAGE? OR - 

SITE? OR HTML () FILE? 

59 31022 S1:S3(10N)S4:S8 

510 28367 SEARCH? OR RESEARCH? OR RETRIEV? OR INQUIR? OR QUERY? OR Q- 

UERIES OR EXAM IN? OR INSPECT? 

511 26058 REQUEST? OR DATA () MINE? OR DATA () MINING? OR DATAMINE? OR D- 

ATAMINING? OR FIND? OR DISCOVER? 

512 12078 INTERROG AT ? OR WEBCRAWL? OR WEB () CRAWL? OR METACRAWL? OR M- 

ETA() CRAWL? OR SEEK? OR SORT? OR HUNT? 

513 2317 JAPIO OR JPO OR EPO OR USPTO OR WIPO 

514 28865 STORE OR STORING OR MEMORY OR ACCUMULAT? OR RECEIV? OR ACC- 

EPT? OR ACQUIR? OR OBTAIN? OR CULL? OR CACHE? 

515 19756 STOCK? OR COLLECT? OR GATHER? OR GLEAN? OR AMASS??? OR ACC- 

RU? OR AGGREGAT? OR COMPIL? OR SIFT? OR CACHING 

516 28622 PULL () DOWN? OR TAKE? OR STORAGE? OR TAKING? OR DERIV? OR P- 

ROCUR??? OR GET? OR TAP? ? OR CAPACIT? 

517 21330 CONVERT OR CONVERTS OR CONVERTING OR CONVERSION? OR TRANSF- 

ORM? OR ALTER??? OR REFORMAT? OR EXTRACT? 

518 29339 MODIF? OR REVIS??? OR TRANSLAT? OR REMODEL? OR ADAPT? OR C- 

HANGE OR CHANGE? ? OR CHANGING? OR COLLAT? 

519 29833 FIRST? OR 1ST OR PRIMARY OR INITIAL? OR ORIGINAL? OR 

FF? OR MAIN OR CHIEF OR INTRODUCTORY? 

520 29767 SECOND? OR 2ND OR DOUBL? OR TWIN? OR EXTRA? OR - 

ANOTHER OR SUBSIDIAR? OR AUXILIAR? 

521 24715 THREE? OR TRIO? OR TRIUNE? OR TRIAD? OR TRIPLr 

OR THIRD OR 3RD 

522 5875 IC=G06F? 

523 2335 (S9 OR S13) (5N)S10:S12 

524 1920 S23 AND (S14;S16 AND S17 : S18) (10N) (SI : S8 ) 

525 1819 S24 AND (S17:S18 AND S19 : S21) (10N) SI :S8 * 

526 1582 S25 AND S17 : S18 (10N) (SI : S3 OR S14:S16) 

527 483 S26 AND S22 

528 62 S23:S27 AND S1:S3/TI 

529 1582 S25 AND S26 

530 1920 S24:S27 OR S29 

531 1814 S24 AND (S14:S16 AND S17 : S18 ) (5N) SI : S8 

532 1280 S31 AND SI : S3 (5N) S4 : S8 

533 405 S32 AND S22 

534 1280 S32:S33 

535 365 S3 4 AND (S14 : S16 AND S17 : S18 ) (5N) SI : S4 (5N) S5 : S8 

536 243 S35 AND (S14 : S16 AND S17 : S18 ) (5N) SI : S3 (5N) S5 : S8 

537 745 S27:S28 OR S33 OR S35:S36 

538 569 S37 AND S22 

539 13487 AD=2001:2005 

540 258 S38 NOT S39 

541 258 ID PAT (sorted in duplicate/non-duplicate order) 
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...SPECIFICATION using the replaced second search expression, and 
when no result is obtained from the new database , the first 
transmission means transmits the replaced second search expression to 
the patent information search apparatus . 

The above object can also be achieved by a patent information search 
apparatus according to claim 28, for searching a patent information 
database on the basis of an input search expression and outputting a 
search result, comprising: 

replacement means for replacing an input first search expression with 
a second search expression for searching the patent information 
database , on the basis of bibliographic information search contents of 
the search expression. 
The above object. . . 

...of claims 29 to 31. 

In the above search apparatus or method, the patent information 



database can be searched on the basis of information other than 
bibliographic information. 

Other features and advantages of the... 

..search apparatus 103; 

Fig. 7 is a flow chart for explaining translation processing of the 

patent information search apparatus 103; 

Fig. 8 is a view showing an example of a menu. . . 

..synthesis result of an English abstract and drawings; 

Fig. 15 is a block diagram showing another arrangement of a patent 
information search system according to the present invention; 

Fig. 16A is a view showing the a patent information search system 
and apparatus, and a control method therefor of the present invention 
will... 

. .to an embodiment of the present invention. A client terminal 101 is 
connected to a patent information search apparatus 103 through a 
network 102 (e.g., the Internet). This client terminal 101 is a terminal 
emulator which receives a command from the information search apparatus 
103 and displays it. The command has a... 

..filed in, e.g., Japan, and searches various databases (115, 116, and 
117) storing Japanese patent information and the like in response to a 
request from the client terminal 101. A... 

..charge for fee claiming to the user of the patent information search 
apparatus 103 . 

A translation section 111 requests to translate information 
obtained upon searching the various databases (115, 116, and 117) into a 
predetermined language. A storage... 

.the apparatus to a database server 113. 

This patent information search apparatus 103 incorporates a 
translation engine or tool {not shown) . The translation section 111 
issues the request to this translation. . . 

.communication line together with a text to be translated and receives 
the translation result. 

The database server 113 comprises a CPU 118 for controlling the 
entire database server 113, a ROM 120 storing control programs and the 
like for controlling the database server 113, a RAM 119 used as, e.g., 
a work area in executing a. . . 

.the like stored in the ROM 120, and the various databases (115, 116, and 
117) storing Japanese patent information and the like. 

The English abstract database 115 stores English abstracts 
corresponding to laid-open patent official gazettes as one of patent 
information. The full text search database 116 stores laid-open patent 

official gazettes (Japanese) and the like associated with patents 
filed in Japan. 

The database 117 stores various tables such as a synonym table 1201, 
a word replacement table 1202... 

.conversion table 1302, a free keyword table 1601, and an IPC replacement 
table 1602. 

The database server 113 shown in Fig. 1 is independent of the patent 



information search apparatus 103. However, the present invention is not 
limited to this. The arrangement and function of the database server 
113 may be integrated with the patent information search apparatus 103. 

According to the patent information search system of the present 
invention, a person. . . 

..than Japanese can easily search for information related to patents filed 
in Japan from the database server 113 using a terminal device set 
abroad or a foreign- language terminal device in Search M is used to 
search for patent information stored in the database (115, 116, or 117) 
on the basis of an arbitrary English word input by the user, and has a 
menu layout as shown in Fig. 9A. The "Expert Search " is used to search 
for patent information stored in the database (115, 116, or 117) after 
the search range is narrowed down to, e.g., search... 

..to laid-open patent official gazettes as one of patent information. The 
full text search DB 116 stores laid-open patent official gazettes 
(Japanese) and the like associated with patents filed in Japan. 

The English abstract DB 115 and full text search DB 116 are, e.g., 
relational databases and have table layouts as shown in Figs... 
..example, not only databases associated with patents but also databases 
of utility models, designs, and trademarks can also be handled. The 
database language is not limited to Japanese. 

Referring back to Fig. 2A to explain the control... 

..returns the search result to the terminal 101. 

The search result is received from the patent information search 
apparatus 103 (step S207) . As shown in Fig. 10A, the received search 
result . . . 

..searching the full text search DB 116 is selected in step S204, a 
laid-open patent official gazette corresponding to the clicked 
publication number 1002 can be obtained in a display. . . 

..To switch the English abstract 1013 displayed on the monitor to a 
Japanese laid-open patent official gazette, the "CHANGE " button 1010 is 
clicked to display the laid-open patent official gazette 1014 
corresponding to the displayed laid-open patent official gazette 1014 on 
the monitor. 

The pieces of information obtained by searching the patent 
information search apparatus 103 include not only English information as 
shown in Fig. 10B but... 

..translation engine incorporated in the patent information search 
apparatus 103) to translate the laid-open patent official gazette 
described in Japanese into a language that can be understood by the user 
...search apparatus 103 to estimate the translation fee or order 
translation. 

In addition, in the patent information search system of the present 
invention, when a foreign country has the priority right... 
TO-ZAI-NAN-BOKU KOGYO in Japanese. However, search of patent databases 
based on this translated word is meaningless. In this embodiment, a 
search word is converted into another search word. . . 

..the option for searching the full text search DB 116 is selected, a 
laid-open patent official gazette corresponding to the clicked 
publication number 1002 is transmitted to the client terminal... 



...table 1102 is acquired from the full text search DB 116. 

By looking up the conversion table 1302 shown in Fig. 13B, Japanese 
character strings representing the non-textual contents such. . .recognized 
character strings are replaced with corresponding English character 
strings by looking up a predetermined conversion table. 

Next, the acquired English abstract 1401 and image data 1405 in which 
Japanese character strings representing the non. . . 

...information search apparatus 103 calculates an estimate of the 

translation fee for a laid-open patent official gazette requested by 
the user in step S215 {Fig. 2B) (step S701) . The calculated fee estimate 
is transmitted to the client terminal 101 (step S702)-. When a formal 
translation request is received from the user who had confirmed the 
fee estimate (step S704) , the patent information search apparatus 103 
requests a translation agency or the like to translate the laid-open 
patent official gazette designated by the user using, e.g., an e-mail. 

That is, not the patent information search apparatus 103 but a 
translation agency well versed in various technical fields is charged 
with translation, with this arrangement, the load on the patent 
information search apparatus 103 can be reduced, and in addition, the 
translation accuracy increases as . . . 

...patent information search apparatus 103, as described above. 

(Modification of First Embodiment) 

The above-described patent information search system of the first 
embodiment can be modified to ah arrangement shown in Fig. 15. The Bame 
reference numerals as in Fig. 1... 

...the relay server, so the search apparatus 103 or database server 113 
need not be changed . Hence, the development cost for constructing the 
system of the present invention is minimized. In... 

...thereof, when a user who cannot understand Japanese is to use a database 
storing Japanese patent information and the like, he/she can obtain 
desired information without being conscious of Japanese. 

( Second Embodiment) 

In the second embodiment, an example in which the patent information 
search system described in the first embodiment is applied to PATOLIS 
provided by Japan Patent Information Organization will be described. 
The arrangement and some processing operations of the patent 
information search system of the second embodiment are the same as in 
the first embodiment, and a detailed description thereof will be omitted. 

The PATOLIS is a commercial database in which information associated 
with patents filed in Japan and the like can be searched for. This 
PATOLIS is constituted such. . .as the number of laid-open patent official 
gazettes containing a character string can be obtained . 

However, since technical terms which can be used for fee keyword search 
are determined in... 
...patent system between countries cannot be solved by word translation. 
More specifically, in accessing a patent database in a certain 
country using another language, although the communication protocols 
for access can be matched, the patent formats cannot be... 

...a search expression or keyword in the language of the country (Japanese) 



having the corresponding patent database is created, no search errors 
occur. 

As described above, according to the second embodiment, by applying 
the present invention to a commercial database such as the PATOLIS, 
even a user who cannot understand Japanese can easily use the commercial 
database such as the PATOLIS. 

As described above, according to the first and second embodiments, 
when a user who cannot understand Japanese wants to use a database 
storing Japanese patent information and the like, he/she can obtain 
desired information without being conscious of Japanese. The present 
invention can also be applied to various database search systems. 

(Other Modifications ) 

In the above embodiments of the present invention, a case wherein a 
user whose native language is English uses the database storing Japanese 
patent information and the like has been described. However, the present 
invention is not limited to. . . 

. .a case wherein a user whose native language is French uses a database 
storing German patent information and the like. In this case, a user 
interface such as a menu window. . . 

..present invention is preferably performed in the information search 
apparatus 103 of the first or second embodiment from the viewpoint of 
preventing the influence to the database server 113 . However, the 
search expression replacement function may be imparted to the database 
server. . . 

..present invention can be applied to a search apparatus which is not 
connected to a network and, more specifically, a patent information 
search system in which the terminal 101, search apparatus 103, and 
database 113 are integrated. In. . .his/her personal computer or 
workstation. 

In the above embodiments, a case wherein the Japanese patent 
database (PATOLIS) is set in Japan, and a user who is not proficient in 
Japanese accesses . . . 

.only the Japanese PATOLIS system. When the present invention is to be 
applied to a patent database in a language other than Japanese (e.g., 
the U.S. LEXPAT) , the language used for the patent database and the 
user language must be taken into consideration. For example, the 
present invention can be applied to a case wherein a user whose native 
language is not English accesses an patent database X (this database 
X is set in a country A) which uses English in the country A or.. . 

.line. 

As a further modification of the modification (Fig. 15) of the first 
embodiment, a database constructing function is imparted to the relay 
apparatus. In this case, the relay apparatus sends... 

.apparatus searches the internal database first. Only when no result is 
obtained by searching the database , the relay apparatus BendB the 
search request to the database 113. 
The network applied to. . . 

.the various embodiments as set forth discloses a computer program 
storage medium used for the patent information search system, method 



and apparatus. The medium is defined in the following items (1) to (3): 

(1) A storage medium which stores a program for controlling a patent 
information search system having a patent information search apparatus 
for searching a patent information database on the basis of an input 
search expression and a patent information processing apparatus 
connected to said patent information search apparatus through a network, 
characterized by comprising: 

a program code for replacing an input first search expression with a 
second search expression for searching the patent information database 
, on the basis of bibliographic information search contents of the search 
expression; 

a program code for transmitting the replaced second search 
expression to said patent information search apparatus; 

a program code for searching the patent information database on the 
basis of the received second search expression; and 

a program code for transmitting a search result to said patent 
information processing apparatus. 

(2) A storage medium which stores a program for patent information 
search for searching a predetermined patent information database 
on the basis of an input search expression and outputting a search 
result, characterized by comprising: 

a program code for replacing an input first search expression with a 
second search expression for searching the patent information database 
, on the basis of bibliographic information search contents of the search 
expression; 

(3) A storage. . . 

. .by comprising : 

a program code for replacing an input first search expression with a 
second search expression for searching the patent information database 
, on the basis of bibliographic information search contents of the search 
expression; 

a program code . . . 

..database through the network; and 

a program code for receiving a search result by the patent 
information database and outputting the search result. 
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...ABSTRACT search and display files based on the user's operations of the 
terminals, a file database for accumulating the accessible files 
either within or outside of the information terminals, and a key object 

...display window in order to search for other related files, wherein the 
above key object extracting means searches for and retrieves, from the 
above file database , files containing a key object group extracted 
from the above information input and display window and a key object 
group possessing similarity. . . 

...SPECIFICATION And various independent technologies have also been 
developed to provide search utilities for resources over networks . 
However, conventional search tools and interfaces necessitates "search 



key extraction /' , "selection of search field", "input using search 
protocol based on search logic", among others, as... 

..an object as operable (or active) exist as well. Due to the advancement 
of large capacity recording apparatuses and the advancement and wide 
popularization of networking , we are simply overflowing with large 
amounts of files. Consequently, the need to apply these... 

..lessening the burden from the information search operations, there is 
the invention of the Japanese Patent Application, First Publication, 
No. Hei 5-307570 "File Management Apparatus". This particular claimed 
invention has the characteristic... 

..and the usage thereof, and to perform the search-specific operations 
every time. 

The Japanese Patent Application, First Publication, No. Hei 5-81328 
"Automatic Key Word Input System" is claimed to reduce the... 
..of the claimed invention misleadingly implies. 

The claimed invention relating to the automatic key word extraction 
represented by the Japanese Patent Application, First Publication, 
No. Hei 7-230468 "Automatic Key Word Extraction and Method Therefor" 
proposes an improved method of extracting the most appropriate key word 
from. . . 

..automatic searches including this invention. 

As a method of automatically generating a search format for retrieving 
files stored in a file database , there is the Japanese Patent 
Application, First Publication, No. Hei 5-189492 "Apparatus for 
Automatic Generation of Search Format". However, this apparatus... 

.search and display files based on the user's operations of the 
terminals; 

a file database for accumulating the accessible files; and 
a key object extracting means for extracting key objects from a 
portion of or from the entire information displayed in the above... 

.display window in order to search for other related files, 

wherein the above key object extracting means searches for and 
retrieves, from the above file database , files containing a key object 
group extracted from the above information input and display window and 
a key object group possessing similarity... 

.search for other related files; 

a step for searching for and retrieving, from a file database for 
accumulating the accessible files, files containing a key object group 
extracted from the above information input and display window and a key 
object group possessing similarity ... representing the picture element 
distribution if the key objects are bitmaps. 

When the key object extracting means 10 retrieves a file from the 
file database 20, the determination of the degree of similarity of the 
key objects sent to the... 

.procedure illustrated in Fig. 3. With respect to a single file retrieved 
from the file database 20, the key object extracting means 10 first 
performs the determination of the category of the inputted object at 
process S2-1. The... user may refer to the file access histories of other 



users who share the file database or even freely take advantage of 
the accessed information. Moreover, by simultaneously displaying 
multistaged search results such as a... 

.CLAIMS search and display files based on said user's operations of the 
terminals; 

a file database for accumulating accessible files; and 
a key object extracting means for extracting key objects from at 
least a portion of the information displayed in said information 
input and display window in order to search for related files, 

wherein said key object extracting means searches for and 
retrieves, from said file database , files containing said key 
object group extracted from said information input and display 
window and a key object group possessing similarity greater. . . 

.search for other related files; 

a step for searching for and retrieving, from a file database for 
accumulating the accessible files, files containing a key object 
group extracted from said information input and display window and 
a key object group possessing similarity greater. . . 
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Detailed Description 

Claims 

English Abstract 

...Through retrieving compliant HTML pages, a search engine forwards 
retrieved pages to an object oriented database which sorts received 
information by the information's internal organization structure. By 
searching the information as stored in... 

Detailed Description 

A SYSTEM, METHOD, AND MEDIUM FOR RETRIEVING , ORGANISING, AND UTILIZING 
NETWORKED DATA The following description includes some copyrighted 
material* While Applicants do not object to the copying of this 
specification for patent related... 

. . .present invention. 

BACKGROUND OF THE INVENTION 
I . Technical Overview 

The present invention relates to a networked , information gathering 
and delivery system. More specifically, the present invention relates to 
a network based (intranet-based. . .categorized information is hereinafter 
referred to as a "portal" as the portal acts as a primary interface to 
the organized information residing on the network of computers. When a 
portal is located on a user's local site, the user... 



..Also, for simplicity, the network of computers is referred to as the Web 
or the Internet . However, it is understood that any networked group 



of computers storing 5 organized information is included within the 
scope of the invention. 

The various users of ... resources available to the players involved. 
Portal 201 includes a Web site server 202, an online page generator 204 
(also known as a protocol translator ), a searching system 205 (referred 
to herein as a Web 
9 

Crawler), search engines 203 .. .qualifiers, the system attempts to find 
the desired rectifier with the most important values satisfied first . 

The web site server 202 converts the received results from search 
engine 203 into HTML and forwards or "serves" the created page to... 

..lacks the capability or desire to support organized pages locally) 
retrieves pages 307 into an Internet browser (software running on user 
304 's computer which translates HTML pages and displays the pages on 
user 304* s display). 

Pages 307 include at ontologies. These ontologies are translated into 
an HTNE form and sent to user 304* s Internet Browser 305 as template 
pages 307. User 304 next populates the received template pages 307... 

..populated template pages 306, then user 304 forwards the populated 
template pages 306 as formatted web pages 308 to ISP 303 for storage 
. Next, web crawler 3 10 scans ISP for complying pages 308 and retrieves 
them as... 105 which has the capability to publish its own 
protocolcompliant pages. The supplier 105' s site includes a desktop 
framework 401, an Internet browser 305, protocol translator 402, web 
server 403, legacy applications 405, and legacy databases 406. An 
operator at desktop. . . 

..resulting page 404 may be transferred to web server 403 for posting. On 
request, the compiled pages 404 may be transferred to other sites 
(for example, users' portals 102) as pages 410. Alternately, the protocol 
translator may map data... may act like server 403 in Figure 4. To this 
end, legacy applications 507, protocol translator 505, legacy database 

506 may function similarly to that of the corresponding elements in 
Figure 4 . 

Tool suite. .. tracking component 608 are a reverse index database (RIDB) 
61 1 and an object oriented database (00DB) 6 10. Crawler 609 may 
receive pages from a number of sources including a supplier's web 
site 601, a sector portal web site 616 (publishing pages and/or 
ontologies 634), or a private project web site 606. Crawler 609 
parses the received pages into character strings and stores the strings 
in RIDB 611 where RIDB 611 indicates... 

. .ontology tracking component may store the found protocol data in 0ODB 6 
1 0. In storing the found protocol data in 00DB 6 10, ontology 
tracking component 608 may control the 00DB 610 to store the... or 
elemental in nature as that it is combined with other information stored 
in the 00DB (other atoms I 0 or elements, collectively "items") and 
provided to a user. The user requests the items from portal 607 and 
receives data 625. Project knowledge base manager 604 receives data 625 
and stores the data in database 605. Here, the received data 625 may 
be cataloged according to current projects a user is working. Also, the 
database 605 may also contain catalogs of items. For example, a user may 
want to build a device that requires a first component and a second 



component. The user searches via a search query (Boolean, hierarchical, 
parametric. . . 

. . .base manager may store the information relating to items A and B 
together in a first catalog in database 605 and items C and D 
together in a second catalog in database 605. Accordingly, when the 
user needs to evaluate or try a different item (part, process. . .use in 
the creation of end items; while at least one goal of the project 
database is to store the created end items (products or processes) as 
they evolve. 

Once a user completes an. . . 

...616) and for information (627) from internal information providers 
(606) . 

2) the portal stores the received information; 

3) the portal dynamically creates web pages for a user based on the 
stored 

information; and, 

4) provides links to the additional... 

...tools may autonomously access the information stored in the portal 607 
or stored in supplier web sites 601 to compile and assist users in 
defining end items. 



Claim 

I A system for organizing information comprising: 

a data source storing data including organizational terms; 

a network connected to said data source; 

a portal connected to said network for retrieving the data... 
...21 

. The system according to claim 8, wherein said organizational manager 
dynamically organizes said data based an ontology developed from the 
received organizational terms. IL The system according to claim 1, 
wherein the organizational terms include at... 

...descriptor includes a method identifier. 

15 A system for organizing information comprising: 

a data source storing data including organizational terms; 

a network connected to said data source; 

a portal connected to said network for retrieving the data... 

. . .video display. 
22 

. The system according to claim 16, wherein said portal further 
comprises: an extraction device for extracting the categorized data 
from said storage device. 

21 The system according to claim 15, wherein said portal further 
comprises: an extraction device for extracting the categorized data 
from a storage device, wherein said extraction device extracts the 
categorized data based on Boolean search criteria. 

22 The system according to claim 15, wherein said portal further 



comprises: an extraction device for extracting the categorized data 
from a storage device, wherein said extraction device extracts the 
categorized data based on hierarchical search criteria. 

23 The system according to claim 15, wherein said portal further 
comprises: an extraction device for extracting the categorized data 
from a storage device, wherein said extraction device extracts the 
categorized data based on parametric search criteria. 5 24. The 
system according to claim 20, wherein said storage... 

. .of class, method, and attribute criteria. 

25 The system according to claim 20, wherein said storage device 
receives said data from a reverse index database . 

26 The system according to claim 20, further comprising: 

an organizational manager for organizing said data based on a 
previously stored ontology of the organizational terms for storage in 
said storage device. 

27 The system according to claim 20, further comprising: 

an organizational manager for dynamically organizing said data based 

an ontology developed from the received organizational terms. 

23 

. The system according to claim 15, wherein the organizational terms 
include at . . . 

..according to claim 32, wherein said categorizing step further includes 
the step of 

organizing the received data based on the organizational terms. 

36 The method according to claim 35, wherein said organizing step... 
. .terms . 

37 The method according to claim 35, wherein said organizing step 
dynamically organizes said data based an ontology developed from the 
received organizational terms, 

38 The method according to claim 33, wherein the organizational terms 
include at... a storage device. 

25 

. The method according to claim 43, further comprising the step of 
extracting the categorized data from said storage device. 

45 The method according to claim 43, further comprising the step of 
extracting the categorized data from said storage device based on 

Boolean search criteria. 

46 The method according to claim 43, further comprising the step of 
extracting the categorized data from said storage device based on 

hierarchical search criteria. 

47 The method according to claim 43, further comprising the step of 
extracting the categorized data from said storage device based on. 

parametric search criteria. 



48 The method according to claim 43, wherein said. 



...of class, method, and attribute criteria. 

49 The method according to claim 44, wherein said storing step 
receives said data from a reverse index database . 

50 The method according to claim 44, further comprising the step of 
organizing said data based on a previously stored ontology of the 
organizational terms for storage in said storage device. 

51 The method according to claim 44, further comprising the step of 
dynamically organizing said data based an ontology developed from the 

received organizational terms. 

52 The method according to claim 42, wherein the organizational terms 
include at. . . 
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Detailed Description 

... case, CD-ROMs have been used to export 
databases to multiple users so that information storage and 

retrieval takes place at the user site , 
In the U,S. Patent No, 5,010,571 to Ron Katznelson and 

the U,S, Patents Nos, 4,827 provide a method and apparatus for retrieving 
secure 

information from a mass storage medium at a user Bite which 
is not susceptible to attack or compromise by a user. 

It is a further. . . 
. . .of the present invention to 
provide a system for retrieving secure information from a 
mass storage medium at a user site which does not require a 
reconfiguration of a personal computer at the user site, 
These... on the computer screen, print out a hard copy 
and/or transmit a copy by LAN or modem to another 
In accordance with the SCSI standard, the SCSI bus 
extends up to twenty-six feet... 
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. . . has patent extract storing unit that extracts predetermined data 
e.g. application data, detailed summary, drawing data to automatically 
generate a patent extract , and registers extract into patent 
database 

...Abstract (Basic): NOVELTY - A patent extract storing unit (103) 
extracts predetermined data e.g. application data, detailed summary, 
drawing data to automatically generate a patent extract . The 
patent extract is then registered into the patent database . 
DETAILED DESCRIPTION - A patent document storing unit (102) 
classifies every documented application, detailed statements, detailed 
summary, and detailed drawing of the patent document input into a 
terminal equipment, and stores the data into a patent database (104 



...ADVANTAGE - Reduces processing burden. Reduces time required for loading 
and observing search document since amount of documents that needs to 
be confirmed are reduced. DESCRIPTION OF DRAWING (S) - The figure shows 
the structural diagram of the automatic patent - extracting 
production system. (102) Patent document storing unit; (103) 
Patent extract storing unit; (104) Patent database . 
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.Abstract (Basic) : The data copyright management system has a database 
and a key control centre for managing copyright in the event that a 



primary user edits a copyrighted primary data. The latter is... 

...centre sends the primary use permit key to the primary user upon receipt 
of a request for distribution of the primary key from the primary 
user who requires use of the data. The primary user decrypts the 
copyrighted primary data to plain- text using the received primary 
use key for primary utilisation of the data... 
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PROBLEM TO BE SOLVED: To shorten the network administration time by 
updating the configuration of a target device according to configuration 
data . 

SOLUTION: A hypertext transfer protocol (HTTP) server receives a URL - 
encoded request from a browser 83 and processes it. A URL in it specifies 
one CGI script. . . 

. . . execute the CGI script so as to dynamically generate a response to the 
URL-encoded request The response sent to the browser 83 visually 
presents the status and configuration of the target device. The CGI script 
specified by the URL-encoded request is executed so as to update the 
status and configuration of the target device and then the HTTP server 103 
changes entries of the status and configuration in a data base 105 
regarding the device. 
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ABSTRACT 



PROBLEM TO BE SOLVED: To improve a reproduction rate while the precision of 

retrieval is highly maintained. 
SOLUTION: A computer 1 reads patent literature (primary information) from 
a data base 3, extracts secondary information on a bibliographical 
item and an index word and stores them in an auxiliary storage device 
7. A user inputs a retrieval expression by using a keyboard 4 and a 
pointing device 5 while he views a... 

. . . the coappearance of the index word is regulated by an OR approximate 
operator in the retrieval expression, the computer 1 refers the inputted 
index word with the index word extracted from. 



36/3,X/155 (Item 155 from file: 347) 

DIALOG (R) File 347:JAPI0 

(c) 2005 JPO & JAPIO. All rts. reserv. 



06341266 **Image available** 
INTERNET DOCUMENT RETRIEVAL ENGINE AND CONTROL METHOD THEREOF 

PUB, NO.: 11-282870 [JP 11282870 A] 

PUBLISHED: October 15, 1999 (19991015) 

INVENTOR (s): KIKUCHI SHINJI 

APPLICANT (s) : NEC CORP 

APPL. NO. : 10-087035 [JP 9887035] 

FILED: March 31, 1998 (19980331) 

INTERNET DOCUMENT RETRIEVAL ENGINE AND CONTROL METHOD THEREOF 

INTL CLASS: G06F-017/30 

ABSTRACT 

. . . between different types of databases is realized while reducing the 
concentration of access frequency, since URL information is acquired by 
registering data in plural types of DBMS lOh based on the vocabulary to be 
included in Internet document collected by a retrieval robot 10a and 
simultaneously extracting the registered data from the DBMS lOh as 
corresponding type according to an inquiry character string. 
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ABSTRACT 



PROBLEM TO BE SOLVED: To retrieve a Web page with the same mark without 
retrieving a key word nor generating a link directory when the Web page 
is retrieved . 

SOLUTION: A mark ID etc., is implanted in a specified mark image by 
responding a mark request from a WWW ( world wide web ) server 102, 
information on the Web page corresponding to the mark ID is registered in 

... server 103. The mark is attached to the created Web page and registered 
in a Web page DB 3021 by the server 102. The Web page is acquired 
from the server 102, the information implanted in the mark is read and a 
request for retrieval by the mark is issued to the server 103 by a 
client terminal 101. Information... 

...to the specified mark ID is transmitted to the terminal 101 by referring 
to the DB 2021, by the server 103. 
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ABSTRACT 

. . .means deriving data record in accordance with a derivation rule obtained 
in a derivation rule retrieval means. 

SOLUTION: A data read means 10 receives a data derivation command and reads 
one data record from a database 60. The data record which is read is sent 
to the derivation rule retrieval means 20. The derivation rule retrieval 
means 20 retrieves a derivation rule group 50 in accordance with the 
content of the sent data record. . . 

...record which is read by the data read means 10 are sent to the data 
derivation means 30 from the derivation rule retrieval means 20. The 

data retrieval means 30 generates the derived data record from the 
original data record in accordance with the description of the obtained 
derivation rule and sends the generated derivation data record to a 
data - base write means 40. 
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ABSTRACT 

...in accordance with the secular change of the PI. Then PI to be published 
is retrieved by referring to the reference data and history data 
corresponding to the retrieved PI are referred to. Then PI having history 
changed after the preceding segmentation time is extracted while 
determining edition processing. For instance, a patent information 
edition file 40 storing , a patent data base 10 and preceding 
edition contents related to patent information is included in the storage 
means . 
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ABSTRACT 

...BE SOLVED: To accurately grasp the gist of an invention and to improve 
the keyword retrieval efficiency by mechanically extracting keyword data 
as it is from the range, etc., of a... 

...of Japanese syllabary), signs and Roman latters, etc., is extracted from 
the document information of patent application such as the range, etc., 
of the patent demand which is stored in the external storage means 3 
and second keyword data consisting of the character strings of 
HIRAKANA( cursive form of Japanese syllabary) is extracted . Then, third 
keyword data consisting of specified KANJI is obtained in a non- 
extraction state . 
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Abstract (Basic) : 

. . . A method for analyzing and using intellectual property 

information and a system thereof are provided to mutually exchange 
views on information on intellectual property by accessing to 
intellectual property information database on line through a period 
and a keyword fixed, and by providing the information to research 
workers and persons in charge. 

An on-line intellectual property information database (100) 
includes information on all sorts of intellectual properties . An 
intellectual property information sampling unit (200) includes a 
front page sampling unit (210), a data conversion unit... 

...provided from the front page sampling unit (210) and provides the data 
converted to an intellectual property information analyzing 
unit (300). The specialized information sampling unit (230) fetches image 
information and provides the information to the intellectual 
property analyzing unit (300). An e-mail transmitting/receiving 
unit (400) transmits the information provided from the intellectual 
property information analyzing unit (300). A research worker 
analyzing unit (500) analyzes and classifies the information provided 
from the intellectual property information analyzing unit (300) via 
the e-mail transmitting/receiving unit (400... 
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database. DETAILED DESCRIPTION - A patent document storing unit 
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classifies every documented application, detailed statements, 
detailed summary, 
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USE - For automatically generating document of particular patent. 
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CLAIMS 



[Claim(s)] 

[Claim 1] The patent extract creation automatic creation method characterized by having a means to store in a 
database the patent extract which extracted information from the application which constitutes said patent 
document, the epitome, and the drawing, carried out automatic creation and this created the patent extract on the 
occasion of bag creation processing of a patent document. 

[Claim 2] A patent document storing means to classify for every document of an application, a specification, an 
epitome, and a drawing from the patent document inputted into the terminal unit on the occasion of package 
creation of a patent document, and to store in a patent bag database, The patent extract creation automatic 
creation method characterized by having a patent extract storing means to extract predetermined information 
from the application outputted from said patent document storing means, an epitome, and drawing information, 
to generate a patent extract automatically, and to register said generated patent extract to said patent bag 
database. 

[Claim 3] (a) The patent document storing processing which classifies for every document of an application, a 
specification, an epitome, and a drawing from the inputted patent document, and is stored in a patent bag 
database, And extract predetermined information from the application outputted from the (b) aforementioned 
patent document storing means, an epitome, and drawing information, and a patent extract is generated 
automatically. The record medium which recorded the program which performs each processing of the above 
(a) of processing ** which registers said generated patent extract into said patent bag database, and (b) by 
computer. 



[Translation done.] 
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DETAILED DESCRIPTION 



[Detailed Description of the Invention] 
[0001] 

[Field of the Invention] About the method which generates a document automatically, especially, this invention 

is used for automatic creation of a patent extract, and relates to a suitable method. 

[0002] 

[Description of the Prior Art] Conventionally, in management by the electronization of a patent document, the 
whole patent document (an application, a specification, a drawing, epitome) was stored in file equipment, and 
retrieval processing was performed about the whole patent document. For this reason, the time amount which 
retrieval processing of a patent document takes with increase of the patent document kept increases. 
[0003] Although that check to which application of the same kind is not carried out is performed in case bag 
processing is performed and subsequent patent application is performed in order to manage a patent document 
in patent application operating processing in a company etc., when retrieval processing takes a load and time 
amount, in having searched the patent application specification whole sentence at this time, the activity man day 
which visual inspection of a specification and a drawing takes also increases. In addition, although an abstract 
etc. can be obtained about the patent document after application public presentation, for example using service 
of patent information retrieval system, such as PATOLIS, information is not acquired about the patent 
application before application public presentation. 
[0004] 

[Problem(s) to be Solved by the Invention] It described above, and as carried out, in the conventional approach, 
it has the trouble of the following publication. 

[0005] The 1st trouble is that the retrieval processing time in retrieval processing of a patent document 
increases. 

[0006] The reason is for the time amount and the load which processing takes to increase in order to perform 
retrieval processing to the whole document for retrieval. 

[0007] The 2nd trouble is that the load and activity man day of visual inspection business of a retrieval 
relevance document increase. 

[0008] The reason is for having to check the whole document, when checking the document applicable to 
retrieval visually. 

[0009] Therefore, this invention is made in view of the above-mentioned trouble, and the purpose is in offering 
the patent extract automatic generation method which mitigates the load of retrieval processing and visual 
inspection processing by carrying put automatic creation and keeping an extract from a patent document, at the 
time of bag creation processing. 
[0010] 

[Means for Solving the Problem] In order to attain said purpose, this invention creates the patent extract of a 
predetermined format at the time of patent bag processing, stores the created patent extract in it, and is made to 
search from an application, a drawing, and an epitome at it to the this stored extract. 
[0011] 

[Embodiment of the Invention] The gestalt of operation of this invention is explained. The patent extract 
creation automatic creation method of this invention is set in the gestalt of the desirable operation. A patent 
document storing means to classify for every documents, such as a document which indicated an application, 
the specification, the epitome, the drawing, the chemical formula, etc. from the patent document inputted into 
the terminal unit (101 of drawing 1 ), and to store in a patent bag database (104 of drawing 1 ) (102 of drawing 
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I ), Predetermined information is extracted from the application outputted from said patent document storing 
means, an epitome, and drawing information, a patent extract is generated automatically, and it has a patent 
extract storing means (103 of drawing 1 ) to register said generated patent extract to said patent bag database. In 
addition, each above-mentioned means 102 and 103 are realizable with the program executed by computer 
which constitutes a terminal unit. 

[0012] In the gestalt of operation of this invention, at the time of patent bag processing, automatic creation of a 
patent extract is performed from an application, a drawing, and an epitome, it is made to perform storing in a 
database, and mitigation of the load at the time of retrieval and compaction of the retrieval processing time can 
be aimed at by searching the created patent extract 

[0013] Moreover, since the contents can be simply checked in case the searched patent document is checked 

with a help, mitigation of an operator's load can be aimed at. 

[0014] 

[Example] The gestalt of operation of above-mentioned this invention is explained below with reference to a 
drawing about one example of this invention that it should explain to a detail further. Drawing 1 is drawing 
showing the configuration of one example of this invention. If drawing 1 is referred to, one example of this 
invention will be equipped with the terminal application 101, the drive equipment (FDD) of a floppy disk 105, 
the patent document storing task (program) 102, the patent extract storing task (program) 103, and the patent 
bag database 104, and will be constituted, and concurrent access of the patent bag database 104 will be made it 
is desirable and possible. 

[0015] Drawing 2 is drawing for explaining actuation of one example of this invention. Actuation of one 
example of this invention is explained with reference to drawing 2 R> 2. 

[0016] Reference of drawing 2 inputs the input of the patent document stored in the floppy disk 105 from the 
floppy disk drive unit connected to the terminal 101 (** of drawing 2 R> 2). A patent document consists of a 
document which indicated an application, the specification, the epitome, the drawing, or the chemical formula. 
[0017] The patent document storing task 102 starts after a patent document input (** of drawing 2 ). 
[0018] In the patent document storing task 102, division of the document unit of a patent document, a character 
code, etc. are changed, and it outputs to a file 106 (** of drawing 2 ). 

[0019] In the patent extract creation task 103, from the document generated by the patent document storing task 
102, an application, an epitome, and a drawing are read and a patent extract is generated. A patent extract can be 
constituted here including the application date extracted from the application, the name of invention, an artificer 
name, an applicant, the technical problem extracted from the epitome, a solution means, the drawing 
information on a selection Fig., etc. 

[0020] The patent document drawn up by the patent document storing task 102 is stored in the patent bag 
database 104 (** of drawing 2 R> 2). 

[0021] The patent extract generated by the patent extract creation task 103 is stored in the patent bag database 
104 f*» of drawing 2 V 

[0022] As for the terminal application 101, the patent document storing task 102, and the patent extract storing 
task 103, it is desirable to use it by dialogic operation, 

[0023] Moreover, in the patent bag database 103, it is supposed that it is more nearly accessible than a terminal 
101. 

[0024] In this example, in order for retrieval of a patent document to make applicable to retrieval the patent 
extract stored in the patent bag database 104 from a terminal 101, it accelerates retrieval time. Moreover, in case 
application of the same class checks whether there is, for example in selection of patent application etc. in 
already make, in having searched the patent application specification whole sentence, retrieval processing takes 
a load and time amount, but according to this example, since the amount of documents for a check is cut down, 
suppression reduction of the increase of the activity man day which visual inspection takes is carried out. 
[0025] In addition, although the input of the patent document stored in the floppy disk 105 is inputted from the 
floppy disk drive unit connected to the terminal 101 in the above-mentioned example, of course, you may make 
it input a patent document into a terminal through a network. 
[0026] 

[Effect of the Invention] As explained above, according to this invention, the effectiveness of the following 
publication is done so. 

[0027] The 1st effectiveness of this invention is being able to reduce the burden and time amount which 
retrieval processing takes in retrieval processing of a patent document. 
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[0028] The reason is because the extract generated automatically by extracting information from a patent 
document is made applicable to retrieval in retrieval of a patent document in this invention. 
[0029] The 2nd effectiveness of this invention is mitigable [ the load and time amount of visual business of the 
document for retrieval ]. 

[0030] The reason is because the amount of documents for a check is cut down in this invention. 



[Translation done.] 
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* NOTICES ♦ 

JPO and NCI PI are not responsible for any 
damages caused by the use of this translation. 

1. This document has been translated by computer. So the translation may not reflect the original precisely. 

2. **** shows the word which can not be translated. 
3. In the drawings, any words are not translated. 



TECHNICAL FIELD 



[Field of the Invention] About the method which generates a document automatically, especially, this invention 
is used for automatic creation of a patent extract, and relates to a suitable method. 



[Translation done.] 
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JPO and NCIPI are not responsible for any 
damages caused by the use of this translation. 

1. This document has been translated by computer. So the translation may not reflect the original precisely. 

2. **** shows the word which can not be translated. 
3. In the drawings, any words are not translated. 



PRIOR ART 



[Description of the Prior Art] Conventionally, in management by the electronization of a patent document, the 
whole patent document (an application, a specification, a drawing, epitome) was stored in file equipment, and 
retrieval processing was performed about the whole patent document. For this reason, the time amount which 
retrieval processing of a patent document takes with increase of the patent document kept increases. 
[0003] Although that check to which application of the same kind is not carried out is performed in case bag 
processing is performed and subsequent patent application is performed in order to manage a patent document 
in patent application operating processing in a company etc., when retrieval processing takes a load and time 
amount, in having searched the patent application specification whole sentence at this time, the activity man day 
which visual inspection of a specification and a drawing takes also increases. In addition, although an abstract 
etc. can be obtained about the patent document after application public presentation, for example using service 
of patent information retrieval system, such as PATOLIS, information is not acquired about the patent 
application before application public presentation. 



[Translation done.] 
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* NOTICES ♦ 

JPO and NCI PI are not responsible for any 
damages caused by the use of this translation. 

1. This document has been translated by computer. So the translation may not reflect the original precisely. 

2. **** shows the word which can not be translated. 
3. In the drawings, any words are not translated. 



EFFECT OF THE INVENTION 



[Effect of the Invention] As explained above, according to this invention, the effectiveness of the following 
publication is done so. 

[0027] The 1st effectiveness of this invention is being able to reduce the burden and time amount which 
retrieval processing takes in retrieval processing of a patent document. 

[0028] The reason is because the extract generated automatically by extracting information from a patent 
document is made applicable to retrieval in retrieval of a patent document in this invention. 
[0029] The 2nd effectiveness of this invention is mitigable [ the load and time amount of visual business of the 
document for retrieval ]. 

[0030] The reason is because the amount of documents for a check is cut down in this invention. 



[Translation done.] 
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* NOTICES * 

JPO and NCI PI are not responsible for any 
damages caused by the use of this translation. 

1 .This document has been translated by computer. So the translation may not reflect the original precisely. 
2.**** shows the word which can not be translated. 
3. In the drawings, any words are not translated. 



TECHNICAL PROBLEM 



(Problem(s) to be Solved by the Invention] It described above, and as carried out, in the conventional approach, 
it has the trouble of the following publication. 

[0005] The 1st trouble is that the retrieval processing time in retrieval processing of a patent document 
increases. 

[0006] The reason is for the time amount and the load which processing takes to increase in order to perform 
retrieval processing to the whole document for retrieval. 

[0007] The 2nd trouble is that the load and activity man day of visual inspection business of a retrieval 
relevance document increase. 

[0008] The reason is for having to check the whole document, when checking the document applicable to 
retrieval visually. 

[0009] Therefore, this invention is made in view of the above-mentioned trouble, and the purpose is in offering 
the patent extract automatic generation method which mitigates the load of retrieval processing and visual 
inspection processing by carrying out automatic creation and keeping an extract from a patent document, at the 
time of bag creation processing. 



[Translation done.] 
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JFO and NCIPI are not responsible for any 
damages caused by the use of this translation. 

1 .This document has been translated by computer. So the translation may not reflect the original precisely. 
2.**** shows the word which can not be translated. 
3.1n the drawings, any words are not translated. 



MEANS 



[Means for Solving the Problem] In order to attain said purpose, this invention creates the patent extract of a 
predetermined format at the time of patent bag processing, stores the created patent extract in it, and is made to 
search from an application, a drawing, and an epitome at it to the this stored extract. 
[0011] 

[Embodiment of the Invention] The gestalt of operation of this invention is explained. The patent extract 
creation automatic creation method of this invention is set in the gestalt of the desirable operation. A patent 
document storing means to classify for every documents, such as a document which indicated an application, 
the specification, the epitome, the drawing, the chemical formula, etc. from the patent document inputted into 
the terminal unit (101 of drawing 1 ), and to store in a patent bag database (104 of drawing 1 ) (102 of drawing 
I ), Predetermined information is extracted from the application outputted from said patent document storing 
means, an epitome, and drawing information, a patent extract is generated automatically, and it has a patent 
extract storing means (103 of drawing 1 ) to register said generated patent extract to said patent bag database. In 
addition, each above-mentioned means 102 and 103 are realizable with the program executed by computer 
which constitutes a terminal unit. 

[0012] In the gestalt of operation of this invention, at the time of patent bag processing, automatic creation of a 
patent extract is performed from an application, a drawing, and an epitome, it is made to perform storing in a 
database, and mitigation of the load at the time of retrieval and compaction of the retrieval processing time can 
be aimed at by searching the created patent extract. 

[0013] Moreover, since the contents can be simply checked in case the searched patent document is checked 
with a help, mitigation of an operator's load can be aimed at. 
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* NOTICES * 

JPO and NCI PI are not responsible for any 
damages caused by the use of this translation. 

1 .This document has been translated by computer. So the translation may not reflect the original precisely. 
2.**** shows the word which can not be translated. 
3. In the drawings, any words are not translated. 



EXAMPLE 



[Example] The gestalt of operation of above-mentioned this invention is explained below with reference to a 
drawing about one example of this invention that it should explain trf a detail further. Drawing 1 is drawing 
showing the configuration of one example of this invention. If drawing 1 is referred to, one example of this 
invention will be equipped with the terminal application 101, the drive equipment (FDD) of a floppy disk 105, 
the patent document storing task (program) 102, the patent extract storing task (program) 103, and the patent 
bag database 104, and will be constituted, and concurrent access of the patent bag database 104 will be made it 
is desirable and possible. 

[0015] Drawing 2 is drawing for explaining actuation of one example of this invention. Actuation of one 
example of this invention is explained with reference to drawing 2 R> 2. 

[0016] Reference of drawing 2 inputs the input of the patent document stored in the floppy disk 105 from the 
floppy disk drive unit connected to the terminal 101 (** of drawing 2 R> 2). A patent document consists of a 
document which indicated an application, the specification, the epitome, the drawing, or the chemical formula. 
[0017] The patent document storing task 102 starts after a patent document input (** of drawing 2 ). 
[0018] In the patent document storing task 102, division of the document unit of a patent document, a character 
code, etc. are changed, and it outputs to a file 106 (** of drawing 2 ). 

[0019] In the patent extract creation task 103, from the document generated by the patent document storing task 
102, an application, an epitome, and a drawing are read and a patent extract is generated. A patent extract can be 
constituted here including the application date extracted from the application, the name of invention, an artificer 
name, an applicant, the technical problem extracted from the epitome, a solution means, the drawing 
information on a selection Fig., etc. 

[0020] The patent document drawn up by the patent document storing task 102 is stored in the patent bag 
database 104 (** of drawing 2 R> 2). 

[0021] The patent extract generated by the patent extract creation task 103 is stored in the patent bag database 
104 (*» of drawing 2 ). 

[0022] As for the terminal application 101, the patent document storing task 102, and the patent extract storing 
task 103, it is desirable to use it by dialogic operation. 

[0023] Moreover, in the patent bag database 103, it is supposed that it is more nearly accessible than a terminal 
101. 

[0024] In this example, in order for retrieval of a patent document to make applicable to retrieval the patent 
extract stored in the patent bag database 104 from a terminal 101, it accelerates retrieval time. Moreover, in case 
application of the same class checks whether there is, for example in selection of patent application etc. in 
already make, in having searched the patent application specification whole sentence, retrieval processing takes 
a load and time amount, but according to this example, since the amount of documents for a check is cut down, 
suppression reduction of the increase of the activity man day which visual inspection takes is carried out. 
[0025] In addition, although the input of the patent document stored in the floppy disk 105 is inputted from the 
floppy disk drive unit connected to the terminal 101 in the above-mentioned example, of course, you may make 
it input a patent document into a terminal through a network. 



[Translation done.] 
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DESCRIPTION OF DRAWINGS 



[Brief Description of the Drawings] 

[Drawing 1] It is drawing showing the configuration of one example of this invention. 
[Drawing 2] It is drawing for explaining actuation of one example of this invention. 
[Description of Notations] 

101 Terminal Application 

102 Patent Document Storing Task 

103 Patent Extract Storing Task 

1 04 Patent Bag Database 

105 Floppy Disk (FD) 
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JPO and NCIPI are not responsible for any 
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1. This document has been translated by computer. So the translation may not reflect the original 
precisely. 

2. **** shows the word which can not be translated. 
3. In the drawings, any words are not translated. 



CLAIMS 



[Claim(s)] 

[Claim 1] The patent extract creation automatic creation method characterized by having a means to 
store in a database the patent extract which extracted information from the application which constitutes 
said patent document, the epitome, and the drawing, carried out automatic creation and this created the 
patent extract on the occasion of bag creation processing of a patent document 
[Claim 2] A patent document storing means to classify for every document of an application, a 
specification, an epitome, and a drawing from the patent document inputted into the terminal unit on the 
occasion of package creation of a patent document, and to store in a patent bag database, The patent 
extract creation automatic creation method characterized by having a patent extract storing means to 
extract predetermined information from the application outputted from said patent document storing 
means, an epitome, and drawing information, to generate a patent extract automatically, and to register 
said generated patent extract to said patent bag database. 

[Claim 3] (a) The patent document storing processing which classifies for every document of an 
application, a specification, an epitome, and a drawing from the inputted patent document, and is stored 
in a patent bag database, And extract predetermined information from the application outputted from the 
(b) aforementioned patent document storing means, an epitome, and drawing information, and a patent 
extract is generated automatically. The record medium which recorded the program which performs 
each processing of the above (a) of processing ** which registers said generated patent extract into said 
patent bag database, and (b) by computer. 
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sentence, retrieval processing takes a load and time amount, but according to this example, since the 
amount of documents for a check is cut down, suppression reduction of the increase of the activity man 
day which visual inspection takes is carried out. 

[0025] In addition, although the input of the patent document stored in the floppy disk 105 is inputted 
from the floppy disk drive unit connected to the terminal 101 in the above-mentioned example, of 
course, you may make it input a patent document into a terminal through a network. 
[0026] 

[Effect of the Invention] As explained above, according to this invention, the effectiveness of the 
following publication is done so. 

[0027] The 1st effectiveness of this invention is being able to reduce the burden and time amount which 
retrieval processing takes in retrieval processing of a patent document. 

[0028] The reason is because the extract generated automatically by extracting information from a patent 
document is made applicable to retrieval in retrieval of a patent document in this invention. 
[0029] The 2nd effectiveness of this invention is mitigable [ the load and time amount of visual business 
of the document for retrieval ]. 

[0030] The reason is because the amount of documents for a check is cut down in this invention. 



[Translation done.] 
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precisely* 
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TECHNICAL FIELD 



[Field of the Invention] About the method which generates a document automatically, especially, this 
invention is used for automatic creation of a patent extract, and relates to a suitable method. 
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* NOTICES ♦ 

JPO and NCIPI are not responsible for any 
damages caused by the use of this translation. 

1 .This document has been translated by computer. So the translation may not reflect the original 
precisely. 

2.**** shows the word which can not be translated. 
3. In the drawings, any words are not translated. 



PRIOR ART 



[Description of the Prior Art] Conventionally, in management by the electronization of a patent 
document, the whole patent document (an application, a specification, a drawing, epitome) was stored in 
file equipment, and retrieval processing was performed about the whole patent document. For this 
reason, the time amount which retrieval processing of a patent document takes with increase of the 
patent document kept increases. 

[0003] Although that check to which application of the same kind is not carried out is performed in case 
bag processing is performed and subsequent patent application is performed in order to manage a patent 
document in patent application operating processing in a company etc., when retrieval processing takes a 
load and time amount, in having searched the patent application specification whole sentence at this 
time, the activity man day which visual inspection of a specification and a drawing takes also increases. 
In addition, although an abstract etc. can be obtained about the patent document after application public 
presentation, for example using service of patent information retrieval system, such as PATOLIS, 
information is not acquired about the patent application before application public presentation. 
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EFFECT OF THE INVENTION 



[Effect of the Invention] As explained above, according to this invention, the effectiveness of the 
following publication is done so. 

[0027] The 1st effectiveness of this invention is being able to reduce the burden and time amount which 
retrieval processing takes in retrieval processing of a patent document 

[0028] The reason is because the extract generated automatically by extracting information from a patent 
document is made applicable to retrieval in retrieval of a patent document in this invention. 
[0029] The 2nd effectiveness of this invention is mitigable [ the load and time amount of visual business 
of the document for retrieval ]. 

[0030] The reason is because the amount of documents for a check is cut down in this invention. 
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* NOTICES * 

JPO and NCI PI are not responsible for any 
damages caused by the use of this translation. 

1. This document has been translated by computer. So the translation may not reflect the original 
precisely. 

2. **** shows the word which can not be translated. 
3. In the drawings, any words are not translated. 



TECHNICAL PROBLEM 



[Problem(s) to be Solved by the Invention] It described above, and as carried out, in the conventional 
approach, it has the trouble of the following publication. 

[0005] The 1st trouble is that the retrieval processing time in retrieval processing of a patent document 
increases. 

[0006] The reason is for the time amount and the load which processing takes to increase in order to 
perform retrieval processing to the whole document for retrieval. 

[0007] The 2nd trouble is that the load and activity man day of visual inspection business of a retrieval 
relevance document increase. 

[0008] The reason is for having to check the whole document, when checking the document applicable 
to retrieval visually. 

[0009] Therefore, this invention is made in view of the above-mentioned trouble, and the purpose is in 
offering the patent extract automatic generation method which mitigates the load of retrieval processing 
and visual inspection processing by carrying out automatic creation and keeping an extract from a patent 
document, at the time of bag creation processing. 
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1 .This document has been translated by computer. So the translation may not reflect the original 
precisely. 
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3. In the drawings, any words are not translated. 



MEANS 



[Means for Solving the Problem] In order to attain said purpose, this invention creates the patent extract 
of a predetermined format at the time of patent bag processing, stores the created patent extract in it, and 
is made to search from an application, a drawing, and an epitome at it to the this stored extract. 
[0011] 

[Embodiment of the Invention] The gestalt of operation of this invention is explained. The patent extract 
creation automatic creation method of this invention is set in the gestalt of the desirable operation. A 
patent document storing means to classify for every documents, such as a document which indicated an 
application, the specification, the epitome, the drawing, the chemical formula, etc. from the patent 
document inputted into the terminal unit (101 of drawing 1 ), and to store in a patent bag database (104 
of drawing 1 ) (102 of drawing 1 ), Predetermined information is extracted from the application 
outputted from said patent document storing means, an epitome, and drawing information, a patent 
extract is generated automatically, and it has a patent extract storing means (103 of drawing 1 ) to 
register said generated patent extract to said patent bag database. In addition, each above-mentioned 
means 102 and 103 are realizable with the program executed by computer which constitutes a terminal 
unit. 

[0012] In the gestalt of operation of this invention, at the time of patent bag processing, automatic 
creation of a patent extract is performed from an application, a drawing, and an epitome, it is made to 
perform storing in a database, and mitigation of the load at the time of retrieval and compaction of the 
retrieval processing time can be aimed at by searching the created patent extract. 
[0013] Moreover, since the contents can be simply checked in case the searched patent document is 
checked with a help, mitigation of an operator's load can be aimed at. 
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JPO and NCIPI are not responsible for any 
damages caused by the use of this translation. 

1. This document has been translated by computer. So the translation may not reflect the original 
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3. In the drawings, any words are not translated. 



EXAMPLE 



[Example] The gestalt of operation of above-mentioned this invention is explained below with reference 
to a drawing about one example of this invention that it should explain to a detail further. Drawing 1 is 
drawing showing the configuration of one example of this invention. If drawing 1 is referred to, one 
example of this invention will be equipped with the terminal application 101, the drive equipment 
(FDD) of a floppy disk 105, the patent document storing task (program) 102, the patent extract storing 
task (program) 103, and the patent bag database 104, and will be constituted, and concurrent access of 
the patent bag database 104 will be made it is desirable and possible. 

[0015] Drawing 2 is drawing for explaining actuation of one example of this invention. Actuation of one 
example of this invention is explained with reference to drawing 2 R> 2. 

[0016] Reference of drawing 2 inputs the input of the patent document stored in the floppy disk 105 
from the floppy disk drive unit connected to the terminal 101 (** of drawing 2 R> 2). A patent 
document consists of a document which indicated an application, the specification, the epitome, the 
drawing, or the chemical formula. 

[0017] The patent document storing task 102 starts after a patent document input (** of drawing 2 ). 
[0018] In the patent document storing task 102, division of the document unit of a patent document, a 
character code, etc. are changed, and it outputs to a file 106 (** of drawing 2 ). 
[0019] In the patent extract creation task 103, from the document generated by the patent document 
storing task 102, an application, an epitome, and a drawing are read and a patent extract is generated. A 
patent extract can be constituted here including the application date extracted from the application, the 
name of invention, an artificer name, an applicant, the technical problem extracted from the epitome, a 
solution means, the drawing information on a selection Fig., etc. 

[0020] The patent document drawn up by the patent document storing task 102 is stored in the patent 
bag database 104 (** of drawing 2 R> 2). 

[0021] The patent extract generated by the patent extract creation task 103 is stored in the patent bag 
database 104 (** of drawing 2 ). 

[0022] As for the terminal application 101, the patent document storing task 102, and the patent extract 
storing task 103, it is desirable to use it by dialogic operation. 

[0023] Moreover, in the patent bag database 103, it is supposed that it is more nearly accessible than a 
terminal 101. 

[0024] In this example, in order for retrieval of a patent document to make applicable to retrieval the 
patent extract stored in the patent bag database 104 from a terminal 101, it accelerates retrieval time. 
Moreover, in case application of the same class checks whether there is, for example in selection of 
patent application etc. in already make, in having searched the patent application specification whole 
sentence, retrieval processing takes a load and time amount, but according to this example, since the 
amount of documents for a check is cut down, suppression reduction of the increase of the activity man 
day which visual inspection takes is carried out. 

[0025] In addition, although the input of the patent document stored in the floppy disk 105 is inputted 
from the floppy disk drive unit connected to the terminal 101 in the above-mentioned example, of 
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course, you may make it input a patent document into a terminal through a network. 
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http://www4apdl.ncipi.gojp/cgi-bin/tran_web_cgi_ejje 



12/16/2005 



JP,1 1-085799 A [DESCRIPTION OF DRAWINGS] 



Page 1 of 1 



* NOTICES * 

JPO and HCIPI are not responsible for any 
damages caused by the use of this translation. 

1 .This document has been translated by computer. So the translation may not reflect the original 
precisely. 

2.**** shows the word which can not be translated. 
3. In the drawings, any words are not translated. 



DESCRIPTION OF DRAWINGS 



[Brief Description of the Drawings] 

[Drawing 1] It is drawing showing the configuration of one example of this invention. 
[Drawing 2] It is drawing for explaining actuation of one example of this invention. 
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CLAIMS : 



1. A system for providing to system users IJ? centric, multi-channel, time-shifted and real time 
telecommunication services such as live television, television on demand, video on demand, and 
karaoke on demand, comprising: a media content creator subsystem for receiving multiple video 
signal streams each having one of several industry standard communication format, and for 
converting the incoming video signal streams into digital data and compressing the digital data 
into I_P based packets, for transmission over broadband network ; a storage means for storing I£ 
based packets and permitting stored I_P based packets to be retrieved therefrom; a gateway means 
for receiving packets from multiple sources and combining such packets for transmission over a 
broadband communication network ; a media streaming subsystem for receiving, and forwarding 
streams of IP based packets, said media streaming subsystem being responsive to a user request 
and operative to forward a selected stream of I_P based packets from either said content creator 
subsystem or said storage means to said gateway means or to retrieve stored streams of IP based 
packets from said storage means and forward the retrieved IP packets to said gateway means; and 
a content management subsystem for controlling user access to the system and providing user 
account management . 

18. A method for providing TP centric, multi-channel, time-shifted and real time 
telecommunication services including live television, television on demand, video on demand, 
and karaoke on demand, said method comprising: receiving multiple video signal streams each 
having one of several industry standard communication format by a media content creator 
subsystem, and converting the incoming video signal streams into digital data and compressing 
the digital data into IJP based packets, for transmission over broadband network ; storing IJP 
based packets and permitting stored rP based packets to be retrieved from a storage means; 
receiving packets from multiple sources via a gateway means and combining such packets for 
transmission over a broadband communication network ; receiving and forwarding streams of IJP 
based packets using a media streaming subsystem being responsive to a user request and 
operative to forward a selected stream of IP based packets from either said content creator 
subsystem or said storage means to said gateway means or to retrieve stored streams of TP based 
packets from said storage means and forward the retrieved IP packets to said gateway means; and 
controlling user access to the system and providing user account management. 

35. A computer program embodied on a computer readable medium for providing TP centric, multi- 
channel, time-shifted and real time telecommunication services including live television, 
television on demand, video on demand, and karaoke on demand, comprising: a code segment for 
receiving multiple video signal streams each having one of several industry standard 
communication format by a media content creator subsystem, and converting the incoming video 
signal streams into digital data and compressing the digital data into rP based packets, for 
transmission over broadban d network ; a code segment for storing Ij? based packets and permitting 
stored Ij? based packets to be retrieved from a storage means; a code segment for receiving 
packets from multiple sources via a gateway means and combining such packets for transmission 
over a broadband communication network ; a code segment for receiving and forwarding streams of 
IP based packets using a media streaming subsystem being responsive to a user request and 
operative to forward a selected stream of IP based packets from either said content creator 
subsystem or said storage means to said gateway means or to retrieve stored streams of IJP based 
packets from said storage means and forward the retrieved IP packets to said gateway means; and 
a code segment controlling user access to the system and providing user account management. 
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With the explosive growth of the World Wide Web, the public is gaining access to massive 
amounts of information. However, locating needed and relevant information remains a 
difficult task, whether the information is textual or visual. Text search engines have 
existed for some years now and have achieved a certain degree of success. However, 
despite the large number of images available on the Web, image search engines are still 
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Understanding distributed applications is a tedious and difficult task. Visualizations based 
on process-time diagrams are often used to obtain a better understanding of the 
execution of the application. The visualization tool we use is Poet, an event tracer 
developed at the University of Waterloo. However, these diagrams are often very complex 
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Peer-to-Peer (P2P) file sharing software has brought a hot discussion on P2P file sharing 
among all businesses. Freenet, Gnutella, and Napster are the three most popular P2P file 
sharing applications. They use three distinct protocols and these protocols come with 
different characteristics. In this paper, we discuss the protocols of these P2P file sharing 
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The Internet in its present form does not protect content from censorship. It is 
straightforward to trace any document back to a specific Web server, and usually directly 
to an individual. As we discuss below, there are valid reasons for publishing a document in 
a censorship-resistant manner. Unfortunately, few tools exist that facilitate this form of 
publishing. We describe the architecture of robust systems for publishing content on the 
Web. The discussion is in the context of Publius, as that ... 
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As online document collections continue to expand, both on the Web and in proprietary 
environments, the need for duplicate detection becomes more critical. Few users wish to 
retrieve search results consisting of sets of duplicate documents, whether identical 
duplicates or close matches. Our goal in this work is to investigate the phenomenon and 
determine one or more approaches that minimize its impact on search results. Recent 
work has focused on using some form of signature to characterize a do ... 
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Extending beyond the boundaries of science, art, and culture, content-based multimedia 
information retrieval provides new paradigms and methods for searching through the 
myriad variety of media all over the world. This survey reviews lOO&plus; recent articles 
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Digital library mediators allow interoperation between diverse information services. In this 
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